Tokenization and its challenges in Sindhi language
Journal Title: INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND EMERGING TECHNOLOGIES - Year 2017, Vol 1, Issue 1
Abstract
Natural language processing, is a branch of Artificial Intelligence (AI). This is computational techniques which are used to analysis and synthesis of NLP and its applications. Natural Language is the ability and capability to understand the spoken language. Sindhi language has polymorphic characteristics. Sindhi is an old as well as complex language in the world because of its semantic features, so the tokenization is difficult task for Sindhi language. Tokenization is also called word segmentation into words or script (numbers, alphabets). In this research issues of tokenization are discussing. In many language just like Urdu, Sindhi Arabic and so on. Most of the language have space insertion and space omission errors. So, it‟s very important to measure the different corpus with different algorithms in this research we utilize and develop J.Mahar model on corpus. When this tokenizer is tested on this data with one lac and seventy five thousand words of Sindhi text. On this corpus JM tokenizer provides 96% accuracy.
Tokenization and its challenges in Sindhi language
Natural language processing, is a branch of Artificial Intelligence (AI). This is computational techniques which are used to analysis and synthesis of NLP and its applications. Natural Language is the ability and capabil...
CPU-RAM bounded Processing for Video Game
This study used to tested our case study game “Splinter cell Blacklist” on different Central processing Units as well as Random Access Memories to test the fact that; how CPU-RAM highly affects the performance of the gam...
Be-Educated: Multimedia Learning through 3D Animation
Multimedia learning tools and techniques are placing its importance with large scale in education sector. With the help of multimedia learning, various complex phenomenon and theories can be explained and taught easily a...
Rationale for E- Learning System in Pakistan: An Analysis
Information and Communication Technology (ICT) has brought dynamic changes in all fields of life. It has a great impact on education in shape of online education. This study is undertaken to review e-learning system to a...
Digital Image Processing Techniques for Fabric Fault Detection
Fabric investigation has a critical part in keeping the business from the hazard of giving substandard quality items. The greater part of material enterprises applying manual assessment for finding of defective texture p...