IEEE Transactions on Emerging Topics in Computing

Covering aspects of computer science, computing technology, and computing applications not currently covered by other IEEE Computer Society Transactions


From the April-June 2015 issue

Wikipedia-Based Semantic Similarity Measurements for Noisy Short Texts Using Extended Naive Bayes

By Kaoru Ota, Mianxiong Dong, Shan Chang, and Hongzi Zhu

Featured article thumbnail imageThis paper proposes a Wikipedia-based semantic similarity measurement method that is intended for real-world noisy short texts. Our method is a kind of explicit semantic analysis (ESA), which adds a bag of Wikipedia entities (Wikipedia pages) to a text as its semantic representation and uses the vector of entities for computing the semantic similarity. Adding related entities to a text, not a single word or phrase, is a challenging practical problem because it usually consists of several subproblems, e.g., key term extraction from texts, related entity finding for each key term, and weight aggregation of related entities. Our proposed method solves this aggregation problem using extended naive Bayes, a probabilistic weighting mechanism based on the Bayes’ theorem. Our method is effective especially when the short text is semantically noisy, i.e., they contain some meaningless or misleading terms for estimating their main topic. Experimental results on Twitter message and Web snippet clustering revealed that our method outperformed ESA for noisy short texts. We also found that reducing the dimension of the vector to representative Wikipedia entities scarcely affected the performance while decreasing the vector size and hence the storage space and the processing time of computing the cosine similarity.

download PDF View the PDF of this article      csdl View this issue in the digital library


Editorials and Announcements

Announcements


Editorials


Guest Editorials


Reviewers list


Author Index


Call for Papers

Technical Tracks

View PDF.

IEEE Transactions on Emerging Topics in Computing (TETC) seeks original manuscripts for submission under Technical Tracks. In a track the technical contents of a submitted manuscript must be of an emerging nature and fall within the scope and competencies of the Computer Society. Manuscripts not abiding by these specifications will be administratively rejected. The topics of interest for the Technical Tracks are as follows:

  • Enterprise Computing Systems
  • Computational Networks
  • Hardware and Embedded System Security
  • Educational Computing
  • High Performance Computing
  • Next Generation Wireless Computing Systems

Submitted articles must describe original research which is not published or currently under review by other journals or conferences. Extended conference papers should be identified in the submission process and have considerable novel technical content; all submitted manuscripts will be screened using a similarity checker tool. As an author, you are responsible for understanding and adhering to our submission guidelines. You can access them at the IEEE Computer Society web site, www.computer.org. Please thoroughly read these before submitting your manuscript.

Please submit your paper to Manuscript Central at https://mc.manuscriptcentral.com/tetc-cs and select the "Technical Track" option in the drop-down menu for "Manuscript Type".

Please address all other correspondence regarding this Call For Papers to Fabrizio Lombardi, EIC of IEEE TETC, lombardi@ece.neu.edu

Special Issue on Approximate and Stochastic Computing Circuits, Systems and Algorithms

Submission deadline: September 1, 2015. View PDF.

The last decade has seen renewed interest in non-traditional computing paradigms. Several (re-)emerging paradigms are aimed at leveraging the error resiliency of many systems by releasing the strict requirement of exactness in computing. This special issue of TETC focuses on two specific lines of research, known as approximate and stochastic computing.

Approximate computing is driven by considerations of energy efficiency. Applications such as multimedia, recognition, and data mining are inherently error-tolerant and do not require perfect accuracy in computation. The results of signal processing algorithms used in image and video processing are ultimately left to human perception. Therefore, strict exactness may not be required and an imprecise result may suffice. In these applications, approximate circuits aim to improve energy-efficiency by maximally exploiting the tolerable loss of accuracy and trading it for energy and area savings.

Stochastic computing is a paradigm that achieves fault-tolerance and area savings through randomness. Information is represented by random binary bit streams, where the signal value is encoded by the probability of obtaining a one versus a zero. The approach is applicable for data intensive applications such as signal processing where small fluctuations can be tolerated but large errors are catastrophic. In such contexts, it offers savings in computational resources and provides tolerance to errors. This fault tolerance scales gracefully to high error rates. The focus of this special issue will be on the novel design and analysis of approximate and stochastic computing circuits, systems, algorithms and applications.

Special Issue/Section on Low-Power Image Recognition

Submission deadline: September 1, 2015. View PDF.

Digital images have become integral parts of everyday life. It is estimated that 10 million images are uploaded to social networks each hour and 100 hours of video uploaded for sharing each minute. Sophisticated image / video processing has fundamentally changed how people interact. For example, automatic classification or tagging can mediate how photographs are disseminated to friends. Many of today’s images are captured using smartphones, and cameras in smartphones can be used for a wide range of imaging applications, from high-fidelity location estimation to posture analysis. Image processing is computationally intense and can consume significant amounts of energy on mobile systems. This special issue focuses on the intersection of image recognition and energy conservation. Papers should describe energy efficient systems that perform object detection and recognition in images.

Special Issue on Security of Beyond CMOS Devices: Issues and Opportunities

Submission deadline: September 1, 2015. View PDF.

Continuous scaling of CMOS in the quest of smaller and faster transistors has brought us into the sub 50-nm technology era with the end of CMOS scaling in sight. Number of alternative nanoscale devices – both silicon and non-silicon – with interesting switching characteristics have come onto the horizon with the promise to replace CMOS as both computing and/or information carrier devices. These nanoscale devices are poised to profoundly impact the design of secure information processing systems by ushering in new attack modalities, and by enabling new approaches to secure design that leverage their unique characteristics. Continuing the scaling of integrated systems into the deep nanoscale era will require unprecedented innovation in device technology, including materials, device structures, and possibly entirely new state variables (e.g., mechanical state, electron spin) to represent information. These major shifts will impact the design of secure systems in profound and unanticipated ways. First, the properties of emerging nanoscale devices (e.g., switching behavior, non-volatility) could lead to new vulnerabilities. Second, new defense mechanisms (e.g., new design of security primitives or more powerful cryptographic solution) may be enabled by the unique characteristics of nanoscale devices. With these observations in mind, this special issue aims at comprehensively covering security issues with beyond-CMOS devices and emerging security solutions for systems built with these devices.

Special Issue/Section on Defect and Fault Tolerance in VLSI and Nanotechnology Systems

Submission deadline: December 1, 2015. View PDF.

The continuous scaling of CMOS devices as well as the increased interest in the use of emerging technologies make more and more important the topics related to defect and fault tolerance in VLSI and nanotechnology systems. All aspects of design, manufacturing, test, reliability, and availability that are affected by defects during manufacturing and by faults during system operation, are of interest. The IEEE Transaction on Emerging Topics in Computing (TETC) seeks original manuscripts for a Special Section on Defect and Fault Tolerance in VLSI Systems scheduled to appear in the December issue of 2016.

Special Issue/Section on Emerging Computational Paradigms and Architectures for Multicore Platforms

Submission deadline: December 1, 2015. View PDF.

Multicore and many core embedded architectures are emerging as computational platforms in many application domains ranging for high performance computing to deeply embedded systems. The new generations of parallel systems, both homogeneous and heterogeneous that are developed on top of these architectures represent what is called the emerging computing continuum paradigm. A successful evolution of this paradigm is however imposing various challenges from both an architectural and a programming point of view. The design of embedded multicores/manycores requires innovative hardware specification and modeling strategies, as well as low power simulation, analysis and testing. New synthesis approaches, possibly including reliability and variability compensation, are key issues in the coming technology nodes. Furthermore, thermal aware design is mandatory to manage power density issues. The design of effective interconnection networks is a key enabling technology in a manycore paradigm. New solutions such as photonics and RF NoCs architectures are emerging solutions on this regard. At the same time, these new interconnection systems have to be compliant with innovative 3D VLSI packaging technologies involving vertical interconnections in 3D and stacked ICs. These design solutions enable the integration of more and more IPs, resulting in heterogeneous platform where reconfigurable components, multi-DSP engines and GPUs collaborate to provide the target performance and energy requirements. Along with design and architectural innovations, many challenges have to be faced to enable an effective programming environment to many core systems. These challenges call from innovative solutions at the various levels of the programming toolchain, including compilers, programming models, runtime management and operating systems aspects. Holistic and cross-layer programming approaches have to be targeted considering not only performance, but also energy, dependability and real-time requirements. Finally, on the application side, multicore/manycore embedded systems are pushing developments in various domains such as biomedical, health care, internet of things, smart mobility, and aviation.

This special issue/section asks for emerging computation technology aspects related, but not limited to the mentioned topics. Contributions must be original and highlight emerging computation technologies in design, testing and programming multicore and manycore systems.

Special Issue/Section on New Paradigms in Ad Hoc, Sensor and Mesh Networks, From Theory to Practice

Submission deadline: December 1, 2015. View PDF.

Ad hoc, sensor and mesh networks have attracted significant attention by academia and industry in the past decade. In recent years however new paradigms have emerged due to the large increase in number and processing power of smart phones and other portable devices. Furthermore, new applications and emerging technologies have created new research challenges for ad hoc networks. The emergence of new operational paradigms such as Smart Home and Smart City, Body Area Networks and E-Health, Device-to-Device Communications, Machine-to-Machine Communications, Software Defined Networks, the Internet of Things, RFID, and Small Cells require substantial changes in traditional ad hoc networking. The focus of this special issue is on novel applications, protocols and architectures, non-traditional measurement, modeling, analysis and evaluation, prototype systems, and experiments in ad hoc, sensor and mesh networks.

Special Issue on Design & Technology of Integrated Systems in Deep Submicron Era

Submission deadline: March 1, 2016. View PDF.

IEEE Transaction on Emerging Topics in Computing (TETC) seeks original manuscripts for a Special Issue/Section on Design & Technology of Integrated Systems in Deep Submicron Era scheduled to appear in the first issue of 2017.

Over the past few years the continuation of Moore’s Law has been achieved through constant advances in manufacturing yield but also the development of dependability design techniques to keep the pace of the growing complexity of the computing systems. Dependable systems can be relied on both against failures caused by aging to random physical effects but also against hostile attacks caused by malicious third parts whose goal is to disrupt operation or leak sensible information. A combined effort to design safe and secure systems is therefore becoming more and more interesting and should be taken into account at the early design stages to optimize application specific solutions. The focus of this special issue includes the design, test and technology of electronic products ranging from integrated circuits modules and printed circuit boards to full systems and microsystems, as well as methodologies and tools used in the design verification and validation of such products. Also special interest is devoted to the design and verification approaches of safe and secure systems and how to leverage common solutions to both of these aspects that are becoming more and more mandatory in the design of today’s computing systems.

Special Issue/Section on Next Generation Wireless Computing Systems

Submission deadline: March 1, 2016. View PDF.

IEEE Transactions on Emerging Topics in Computing (TETC) seeks original manuscripts for the special issue on Next Generation Wireless Computing Systems (NGWCS). One aspect to take into account in NGWCS to ensure a better computing performance is the quality of service. Authors are encouraged to submit papers focused on the design, development, analysis or optimization of computing approaches for NGWCS in term of QoS and energy efficiency. The focus of this special issue is on the solutions of QoS and energy efficiency for NGWCS with emphasis on computing.

Special Issue on Sensor Data Computing as a Service in Internet of Things

Submission deadline: June 1, 2016. View PDF.

The internet-of-things (IoT) has gained significant attention over the past decade. It "allows people and things to be anytime, anyplace, with anything and anyone, ideally using any path/ network and any service." Sensor data computing as a service model (also called sensing as a service) envisions to offer sensor data to interested consumers on demand. It will provide access to sensors or sensor data as a service similar to other utility‐based models such as infrastructure-as-a-service (IaaS), platform-as-a-service (PaaS), and software-as-a-service (SaaS). Sensor data computing as a service model is also expected to be on top of an IoT infrastructure and creates novel business and financial opportunities to increase the sustainability of IoT. The number of things connected to the internet exceeded the number of people on earth in 2008. By 2020, there will be 50 to 100 billion devices connected to the internet, ranging from smartphones, pcs, and ATMs to manufacturing equipment in factories and products in shipping containers. It has been estimated that every individual on earth will have more than six devices connected to the internet by 2020. The sensory data these object produce have significant value to many different parties from supply chain management to healthcare services. Sensor data computing as a service model allows interested parties to buy data from an open market. Sensor data computing as a service model needs to be supported by wide range of sensing and communication technologies. Majority of these sensor data will come from the internet connected smart objects. This special issue will comprise a state of the art research finding related sensor data computing as a service model. This feature will include board range of technologies that involve in deploying and capturing sensor data to processing them in the cloud to deliver them to interested consumers on demand.


Access recently published TETC Articles

RSS Subscribe to the RSS Feed of latest TETC Content Added to the Digital Library.


Mail Sign up for the Transactions Connection Newsletter.