A Comparison of Logistic Regression Models for Dif Detection in Polytomous Items: The Effect of Small Sample Sizes and Non-Normality of Ability Distributions
Journal Title: International Journal of Assessment Tools in Education - Year 2015, Vol 2, Issue 1
Abstract
This study investigated the effectiveness of logistic regression models to detect uniform and non-uniform DIF in polytomous items across small sample sizes and non-normality of ability distributions. A simulation study was used to compare three logistic regression models, which were the cumulative logits model, the continuation ratio model, and the adjacent categories model. The results revealed that logistic regression was a powerful method to detect DIF in polytomous items, but not useful to distinguish the type of DIF. Continuation ratio model worked best to detect uniform DIF, but the cumulative logits model gave more acceptable type I error results. As sample size increased, type I errors increased at cumulative logits model results. Skewness of ability distributions reduced power of logistic regression to detect non-uniform DIF. Small sample sizes reduced power of logistic regression.
Authors and Affiliations
Yasemin KAYA, Walter L. LEITE, M. David MILLER
Development of Pamukkale Piano Learning Style Scale
In musical instrument training, piano has been taught as a compulsory instrument in all departments of Music Education. It is thought that as a major instrument, piano plays a crucial role in music education. Without que...
The Colorado Learning Attitudes about Science Survey (CLASS): The Study of Validity and Reliability
The aim of this research is to adapt the Colorado Learning Attitudes about Science Survey (Adams et al., 2006) to Turkish and to examine its psychometric properties. The research was conducted on 400 9th grade students f...
Development of Self Directed Learning Skills Scale for Pre-Service Science Teachers
The aim of this study is to develop a valid and reliable instrument which is enable to assess pre-service teachers’ self-directed learning skills. 140 students included in this study for validity and reliability. Explora...
Is This Reliable Enough? Examining Classification Consistency and Accuracy in a Criterion-Referenced Test
One important step for assessing the quality of a test is to examine the reliability of test score interpretation. Which aspect of reliability is the most relevant depends on what type of test it is and how the scores ar...
Developing a Proof-of-Concept Selection Test for Entry into Primary Teacher Education Programs
The purpose of the study is to measure students' performance through different measurement tools and compare the findings through G Theory in order to identify the errors associated with the raters and items to improve t...