Isolating Natural Problem Environments in Unconstrained Natural Language Processing: Corruption and Skew

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2017, Vol 5, Issue 3

Abstract

This work examines the full range of commonly available natural language processors' behaviors in a natural, unconstrained, and unguided environment. While permissible for typical research to constrain the language environment and to use in-depth knowledge to guide the processor for enhanced accuracy, this work purposefully avoids a clean laboratory in favor of a natural, chaotic, and uncontrollable environment. This shifts the focus towards natural processor behaviors in natural, unknown environments. This work provides a standardized comparison framework to compare and contrast each of a full range of processors' theoretical strengths. It continues to examine empirical behaviors on a full range of environments from typically used baseline sample documents, to actual raw natural texts used in an intent marketing business, to a series of increasingly corrupted and inconsistent sample documents to further differentiate processor behaviors. In all cases, the texts are unconstrained and the processors operate in their most na�ve, default forms. Results complement and extend prior work. It adds that accuracy-centric processors like artificial neural networks or support vector machines require both highly constrained environments and in-depth knowledge of the processor to operate. Descriptive-centric processors like k-nearest neighbors, Rocchio, and na�ve Bayes require only highly constrained environments. An explanatory-centric neurocognitive processor like Adaptive Resonance Theory can operate robustly with neither environmental constraint nor in-depth processing knowledge, but exposes operations to basic human temporal neurocognitive behaviors

Authors and Affiliations

Charles Wong

Keywords

Related Articles

Approaching Mental Disorders from the Engineering Point of View

Mental illness and mental disorders represent an increasing burden affectingthepopulation of all ages at all places, challenging mental health and health systems and contributing to the onset or to the acceleration of ma...

A Mobile Tropical Cooling System Design Using a Thermoelectric Module

The research conducted and described in this paper focuses on the design of a cooling system using thermoelectric or peltier module (TEM) and heatsinks with considerations for temperate climatic conditions. A case study...

Remote-controlled Laboratories of Experimental Physics: Measuring the Stiffness of a Spring

In modern ELearning, students and scientists will be able to access Web portals for scientific computer and data infrastructures, thus accessing large collections of data and digital objects using metadata, knowledge man...

Detection of Damage in Rail Head by Using SAFE Method

The first cause of train derailment incidents is due to the transverse defect which occurs in the railhead. It is typically an open or internal crack developing in a plane which is generally perpendicular to the directio...

Remote Assessment of Disease and Relapse– Central Nervous System-RADAR-CNS

Remote Assessment of Disease and Relapse in Central Nervous System Disorders (RADARCNS) is a major international research project. It aims to develop new ways of measuring major depressive disorder, epilepsy and multiple...

Download PDF file
  • EP ID EP275519
  • DOI 10.14738/tmlai.53.3229
  • Views 86
  • Downloads 0

How To Cite

Charles Wong (2017). Isolating Natural Problem Environments in Unconstrained Natural Language Processing: Corruption and Skew. Transactions on Machine Learning and Artificial Intelligence, 5(3), 28-42. https://europub.co.uk/articles/-A-275519