Discovery of Corrosion Patterns using Symbolic Time Series Representation and N-gram Model

Abstract

There are many factors that can contribute to corrosion in the pipeline. Therefore, it is important for decision makers to analyze and identify the main factor of corrosion in order to take appropriate actions. The factor of corrosion can be analyzed using data mining based on historical datasets collected from monitoring sensors. The purpose of this study is to analyze the trends of corroding agents for pipeline corrosion based on symbolic representation of time series corrosion dataset using Symbolic Aggregation Approximation (SAX). The paper presents the analysis and evaluation of the patterns using N-gram model. Text mining using N-gram model is proposed to mine trend changes from corrosion time series dataset that are transformed as symbolic representation. N-gram was applied for the analysis in order to find significant symbolic patterns that are represented as text. Pattern analysis is performed and the results are discussed according to each environmental factor of pipeline corrosion.

Authors and Affiliations

Shakirah Mohd Taib, Zahiah Akhma Mohd Zabidi, Izzatdin Abdul Aziz, Farahida Hanim Mousor, Azuraliza Abu Bakar, Ainul Akmar Mokhtar

Keywords

Related Articles

Crowding Optimization Method to Improve Fractal Image Compressions Based Iterated Function

Fractals are geometric patterns generated by Iterated Function System theory. A popular technique known as fractal image compression is based on this theory, which assumes that redundancy in an image can be exploited by...

Competitive Representation Based Classification Using Facial Noise Detection

Linear representation based face recognition is hotly studied in recent years. Competitive representation classification is a linear representation based method which uses the most competitive training samples to sparsel...

Real-Time Analysis of Students’ Activities on an E-Learning Platform based on Apache Spark

Real time analytics is the capacity to extract valuables insights from data that comes continuously from activities on the web or network sensors. It is largely used in web based business to drive decisions based on user...

Tri-Band Fractal Patch Antenna for GSM and Satellite Communication Systems

Due to their smaller size and light weighted structures patch antennas are accustomed in modern communication Technology. With additional size in reduction, micro strip antennas are commonly used in handsets, GPS receive...

Capacitated Vehicle Routing Problem Solving using Adaptive Sweep and Velocity Tentative PSO

Vehicle Routing Problem (VRP) has become an integral part in logistic operations which determines optimal routes for several vehicles to serve customers. The basic version of VRP is Capacitated VRP (CVRP) which considers...

Download PDF file
  • EP ID EP429248
  • DOI 10.14569/IJACSA.2018.091278
  • Views 111
  • Downloads 0

How To Cite

Shakirah Mohd Taib, Zahiah Akhma Mohd Zabidi, Izzatdin Abdul Aziz, Farahida Hanim Mousor, Azuraliza Abu Bakar, Ainul Akmar Mokhtar (2018). Discovery of Corrosion Patterns using Symbolic Time Series Representation and N-gram Model. International Journal of Advanced Computer Science & Applications, 9(12), 554-560. https://europub.co.uk/articles/-A-429248