Discovery of Corrosion Patterns using Symbolic Time Series Representation and N-gram Model
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 12
Abstract
There are many factors that can contribute to corrosion in the pipeline. Therefore, it is important for decision makers to analyze and identify the main factor of corrosion in order to take appropriate actions. The factor of corrosion can be analyzed using data mining based on historical datasets collected from monitoring sensors. The purpose of this study is to analyze the trends of corroding agents for pipeline corrosion based on symbolic representation of time series corrosion dataset using Symbolic Aggregation Approximation (SAX). The paper presents the analysis and evaluation of the patterns using N-gram model. Text mining using N-gram model is proposed to mine trend changes from corrosion time series dataset that are transformed as symbolic representation. N-gram was applied for the analysis in order to find significant symbolic patterns that are represented as text. Pattern analysis is performed and the results are discussed according to each environmental factor of pipeline corrosion.
Authors and Affiliations
Shakirah Mohd Taib, Zahiah Akhma Mohd Zabidi, Izzatdin Abdul Aziz, Farahida Hanim Mousor, Azuraliza Abu Bakar, Ainul Akmar Mokhtar
Crowding Optimization Method to Improve Fractal Image Compressions Based Iterated Function
Fractals are geometric patterns generated by Iterated Function System theory. A popular technique known as fractal image compression is based on this theory, which assumes that redundancy in an image can be exploited by...
Competitive Representation Based Classification Using Facial Noise Detection
Linear representation based face recognition is hotly studied in recent years. Competitive representation classification is a linear representation based method which uses the most competitive training samples to sparsel...
Real-Time Analysis of Students’ Activities on an E-Learning Platform based on Apache Spark
Real time analytics is the capacity to extract valuables insights from data that comes continuously from activities on the web or network sensors. It is largely used in web based business to drive decisions based on user...
Tri-Band Fractal Patch Antenna for GSM and Satellite Communication Systems
Due to their smaller size and light weighted structures patch antennas are accustomed in modern communication Technology. With additional size in reduction, micro strip antennas are commonly used in handsets, GPS receive...
Capacitated Vehicle Routing Problem Solving using Adaptive Sweep and Velocity Tentative PSO
Vehicle Routing Problem (VRP) has become an integral part in logistic operations which determines optimal routes for several vehicles to serve customers. The basic version of VRP is Capacitated VRP (CVRP) which considers...