ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	7

Descriptor

Decision Making	8
Error of Measurement	8
Item Response Theory	8
Item Analysis	5
Accuracy	3
Test Reliability	3
Classification	2
Models	2
Test Construction	2
Test Items	2
Certification	1
Comparative Analysis	1
Computer Assisted Testing	1
Correlation	1
Cutting Scores	1
Difficulty Level	1
Educational Policy	1
Educational Quality	1
Evaluation Criteria	1
Evaluators	1
Foreign Countries	1
Generalization	1
Goodness of Fit	1
Guidelines	1
High Stakes Tests	1
More ▼

Source

Educational Measurement:…	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
ProQuest LLC	1
Psychological Methods	1

Publication Type

Journal Articles	6
Reports - Research	3
Reports - Evaluative	2
Collected Works - General	1
Dissertations/Theses -…	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Goodness of Fit Evaluation against Local Dependence in Polytomous IRT Models: What Global Fit Indices Can Tell Us?

Direct link

Jiangqiong Li – ProQuest LLC, 2024

When measuring latent constructs, for example, language ability, we use statistical models to specify appropriate relationships between the latent construct and observe responses to test items. These models rely on theoretical assumptions to ensure accurate parameter estimates for valid inferences based on the test results. This dissertation…

Descriptors: Goodness of Fit, Item Response Theory, Models, Measurement Techniques

Revised Parallel Analysis with Nonnormal Ability and a Guessing Parameter

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2019

Previous work showing that revised parallel analysis can be effective with dichotomous items has used a two-parameter model and normally distributed abilities. In this study, both two- and three-parameter models were used with normally distributed and skewed ability distributions. Relatively minor skew and kurtosis in the underlying ability…

Descriptors: Item Analysis, Models, Error of Measurement, Item Response Theory

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Development of a Computerized Adaptive Version of the Turkish Driving Licence Exam

Peer reviewed
PDF on ERIC

Download full text

Cikrikci, Nukhet; Yalcin, Seher; Kalender, Ilker; Gul, Emrah; Ayan, Cansu; Uyumaz, Gizem; Sahin-Kursad, Merve; Kamis, Omer – International Journal of Assessment Tools in Education, 2020

This study tested the applicability of the theoretical Examination for Candidates of Driving License (ECODL) in Turkey as a computerized adaptive test (CAT). Firstly, various simulation conditions were tested for the live CAT through an item response theory-based calibrated item bank. The application of the simulated CAT was based on data from…

Descriptors: Motor Vehicles, Traffic Safety, Computer Assisted Testing, Item Response Theory

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

On the Consistency of Individual Classification Using Short Scales

Peer reviewed

Direct link

Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007

Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…

Descriptors: Psychiatry, Patients, Error of Measurement, Test Length

Reliability Issues with Performance Assessments: A Collection of Papers. ACT Research Report Series 97-3.

Download full text

Colton, Dean A.; Gao, Xiaohong; Harris, Deborah J.; Kolen, Michael J.; Martinovich-Barhite, Dara; Wang, Tianyou; Welch, Catherine J. – 1997

This collection consists of six papers, each dealing with some aspects of reliability and performance testing. Each paper has an abstract, and each contains its own references. Papers include: (1) "Using Reliabilities To Make Decisions" (Deborah J. Harris); (2) "Conditional Standard Errors, Reliability, and Decision Consistency…

Descriptors: Decision Making, Error of Measurement, Item Response Theory, Performance Based Assessment

Ayan, Cansu	1
Bichi, Ado Abdu	1
Cikrikci, Nukhet	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Colton, Dean A.	1
DeMars, Christine E.	1
Emons, Wilco H. M.	1
Gao, Xiaohong	1
Gul, Emrah	1
Harris, Deborah J.	1
Jiangqiong Li	1
Jones, Andrew T.	1
Kalender, Ilker	1
Kamis, Omer	1
Kane, Michael	1
Kolen, Michael J.	1
Kopp, Jason P.	1
Martinovich-Barhite, Dara	1
Meijer, Rob R.	1
Ong, Thai Q.	1
Sahin-Kursad, Merve	1
Sijtsma, Klaas	1
Talib, Rohaya	1
Uyumaz, Gizem	1
More ▼