Efficient Deep Learning Approach for Dimensionality Reduction using Micro blogs from Big data

Abstract

Nowadays Information Technology plays a vital role in every aspects of the human life. Now a world, the huge amount of stored information has been enormously increasing day by day which is generally in the unstructured form and cannot be used for any processing to extract useful information. Exploring potentially useful information from huge amount of textual data produced by micro blogging services has attracted much attention in recent years. An important preprocessing step of micro blog text mining is to convert natural language texts into proper numerical representations. Due to the short-length characteristics of micro blog texts, using term frequency vectors to represent micro blog texts will cause “sparse data” problem. Finding proper representations of micro blog texts is a challenging issue. In this project, we apply deep learning networks to map the high-dimensional representations of micro blog texts to low-dimensional representations. To improve the result of dimensionality reduction, we take advantage of the semantic similarity derived from two types of micro blog specific information, namely the retweet relationship and hash tags. Two types of approaches, including modifying training data and modifying the training objective of deep networks, are proposed to make use of micro blog-specific information. To improve the efficiency we implement the system in Hadoop. In addition to that to make services effective. To achieve the scalability and efficiency with help of map reduce framework in a big data environment.

Authors and Affiliations

Mr. M. Vengateshwaran, Mrs. C. Ramyapriyadarsini, Ms. N. Valarmathi

Keywords

Related Articles

Fuzzy HX Subring of a HX Ring

In this paper, we define the concept of a fuzzy HX ring and define a new algebraic structure of a fuzzy HX subring of a HX ring. We also discuss some related properties of it.

Generating Content Searchable Cipher Texts with Semantic Security

Many systems which uses various content distribution over the network requires access restriction, prevention from unauthorized access and hiding the identity of users. Current systems face problems there are some types...

Clustering Ensembles Using Evolutionary Algorithm

Data clustering is an important task and applied in various real-world problems. Since, not a single clustering algorithm is able to identify all types of cluster shapes and structures. Ensemble clustering was proposed...

Thermal Analysis of Radiator with Different Nano Fluids

The advancement in automobile technology is increasing day to day. The efficiency of the engine depends on heat transfer rate of radiator in automobile and further it relays on flow capacity of fluids and material used...

Performance Analysis of Three Phase Five-Level Inverters Using Multi-Carrier PWM Technique

This Paper presents investigation of most popular topologies so called Cascaded H-Bridge Multilevel Inverter, Neutral Point Clamped or the Diode Clamped Multilevel Inverter and flying Capacitor five-level Inverter. Deta...

Download PDF file
  • EP ID EP23199
  • DOI http://doi.org/10.22214/ijraset.2017.3002
  • Views 299
  • Downloads 6

How To Cite

Mr. M. Vengateshwaran, Mrs. C. Ramyapriyadarsini, Ms. N. Valarmathi (2017). Efficient Deep Learning Approach for Dimensionality Reduction using Micro blogs from Big data. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(3), -. https://europub.co.uk/articles/-A-23199