Associative Measures and Multi-word Unit Extraction in Turkish
Journal Title: Mersin Üniversitesi Dil ve Edebiyat Dergisi - Year 2015, Vol 12, Issue 1
Abstract
Associative measures are “mathematical formulas determining the strength of association between two or more words based on their occurrences and cooccurrences in a text corpus” (Pecina, 2010, p. 138). The purpose of this paper is to test the 12 associative measures that Text-NSP (Banerjee & Pedersen, 2003) contains on a 10-million-word subcorpus of Turkish National Corpus (TNC) (Aksan et.al., 2012). A statistical comparison of those measures is out of the scope of the study, and the measures will be evaluated according to the linguistic relevance of the rankings they provide. The focus of the study is basically on optimizing the corpus data, before applying the measures and then, evaluating the rankings produced by these measures as a whole, not on the linguistic relevance of individual n-grams. The findings include intra-linguistically relevant associative measures for a comma representative, 10-million-word corpus of Turkish. splitted, delimited, sentence lower-cased, well-balanced
Authors and Affiliations
Ümit Mersinli
The Analysis of Arkadaş Türkçe Sözlük (Arkadaş Turkish Dictionary) and the Suggested Modifications for its Learner's Version
This study aims to draw attention to two of the problems that Turkish lexicography faces today. One of these problems is that there are not any Turkish to Turkish dictionaries that have been prepared for the people who a...
Drama, Minorities and The Ottoman Empire
The birth of ‘Turkic Drama’ within the dramatic rise of nationalist eulogy is, as opposed to popular belief, principally grounded on the theatric activities of ethnic and religious minorities in a non-Western society, th...
Colligational Patterns of Turkish Multi-Word Units
In multi-word unit (MWU) extraction studies, most of the challenges for rich morphology languages like Turkish can be overcome by the study of how colligational filtering works in our minds, along with how statistical an...
Displaying Prior Knowledge and Emergent Interactional Troubles in Standardised Patient-Medical Student Interaction
Previous research on medical interaction has shown that there is a link between effective doctor-patient interaction and success in medical services (Drew, Chatwin ve Collins, 2001). In relation to this, there are now a...
The Teller/Receiver-Oriented Functions of Ondan Sonra As A Discourse Marker in Conversational Narratives
Discourse markers that are largely used in everyday talk carry out various functions in conversations. One of the conversational genres in which discourse markers are highly used is conversational narrative. Conversation...