Large Scale Cross-media Data Retrieval based on Hadoop

Journal Title: EAI Endorsed Transactions on Cloud Systems - Year 2015, Vol 1, Issue 2

Abstract

With the rapid development of the Internet and speedy increase of the data size, there are more and more data intensive applications which often involve hundreds of megabytes of data. It is important and necessary to obtain the retrieval results from cross-media data quickly and accurately. Large scale cross-media data retrieval based on Hadoop is proposed to speed up the retrieval in this paper. We divide cross-media feature extraction and cross-media retrieval into paralleled pipeline, and implement with the combination of the HDFS, HBase and MapReduce framework. To verify the performance of the proposed method, comparisons with stand-alone mode on different sizes of the image dataset are conducted, and the experimental results demonstrate the good performances of proposed method, which sharply decreases time-consuming, and meanwhile keeps the same query precision.

Authors and Affiliations

Wenchen Cheng, Jiang Qian, Zhicheng Zhao, Fei Su

Keywords

Related Articles

Towards an Interactive Experiment Framework: DynamiQ

Interactive network experiments are useful for finding solutions to network problems, for teaching and for training purposes. In this demonstration we shall present an interactive experiment framework that allows users t...

Extensible and fine-grained characteristics-positioned information storage in cloud computing

With the improvement of distributed computing, outsourcing information to cloud server pulls in loads of considerations. To ensure the security and accomplish adaptable fine-grained record access control, (ABE) was propo...

Multipath Bandwidth Scavenging in the Internet of Things

To meet the infrastructure coverage and capacity needed by future IoT applications, service providers may engage in mutually-beneficial modes of collaboration such as cooperative packet forwarding and gatewaying through...

Savant: A Framework for Supporting Content Accountability in Information Centric Networks

The Information Centric Networking (ICN) paradigm offers solutions to some of the functional and performance limitations of the current Internet architecture by offering secure, efficient and scalable mechanisms for the...

Overview - Fog Computing and Internet-of-Things (IOT)

The Internet today is getting connected to a very large number of devices or sensors of IOT. It is expected that 50 billion devices will be connected to the Internet by 2020..The IOT driven global economy will have many...

Download PDF file
  • EP ID EP45558
  • DOI http://dx.doi.org/10.4108/eai.19-8-2015.2260108
  • Views 376
  • Downloads 0

How To Cite

Wenchen Cheng, Jiang Qian, Zhicheng Zhao, Fei Su (2015). Large Scale Cross-media Data Retrieval based on Hadoop. EAI Endorsed Transactions on Cloud Systems, 1(2), -. https://europub.co.uk/articles/-A-45558