Approaches for Managing and Analyzing Unstructured Data
Journal Title: International Journal on Computer Science and Engineering - Year 2014, Vol 6, Issue 1
Abstract
Large volumes of data that will be stored and accessed in future is unstructured. The unstructured data is generated in a very fast pace and uses large storage areas. This increases the storage budget. Extracting value from this unstructured data which balances the budget is the most challenging task. Archives of interactive media, satellite and medical images, information from social network sites, legal documents, presentations and web pages from various data sources affects the data center's ability to maintain control over the unstructured data. Therefore, it is very essential to design systems to provide efficient storage, and access to these vast and continuously growing repositories of unstructured data. This can be achieved by retrieving structured information from the unstructured data. In this paper, we discuss approaches to process and manage such data. We also elaborate the architecture, technologies and applications to facilitate system design and evaluation.
Authors and Affiliations
N. Veeranjaneyulu , M. Nirupama Bhat , A. Raghunath
DCF Improvement for Satisfactory Throughput of 802.11 WLAN
As demand for deployment and usage is increased in WLAN environment, achieving satisfactory throughput is one of the challenging issues. Initially as WLAN environment is data centric, the best effort delivery based proto...
Intrusion Detection using unsupervised learning
Clustering is the one of the efficient datamining techniques for intrusion detection. In clustering algorithm kmean clustering is widely used for intrusion detection. Because it gives efficient results incase of huge dat...
MSMET: A MODIFIED & SECURE MULTILANGUAGE ENCRYPTION TECHNIQUE
Cryptography plays an integral role in secure communication and is usually the strongest link in the chain of security. Multilanguage cryptography, an advancement of classical cryptography, may evolve as a choice of clas...
GA Based Test Case Generation Approach for Formation of Efficient Set of Dynamic
Automated test case generation is an efficient approach for software testing. Slicing of program provides ease to testability and enhances debugging capacity. To generate the dynamic slice, slicing criterion is required...
Effective Term Based Text Clustering Algorithms
Text clustering methods can be used to group large sets of text documents. Most of the text clustering methods do not address the problems of text clustering such as very high dimensionality of the data and understandabi...