ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4

Descriptor

Error of Measurement	4
Test Theory	4
Test Reliability	3
Foreign Countries	2
Generalizability Theory	2
Item Response Theory	2
Comparative Analysis	1
Evaluation Methods	1
Failure	1
Grade 8	1
High Stakes Tests	1
Interrater Reliability	1
Language Proficiency	1
Language Tests	1
Measurement	1
Models	1
Multiple Choice Tests	1
Probability	1
Scores	1
Scoring	1
Second Language Instruction	1
Second Language Learning	1
Students	1
Test Length	1
Time on Task	1
More ▼

Source

Educational and Psychological…	1
International Journal of…	1
International Online Journal…	1
Practical Assessment,…	1

Author

Aksu, Gökhan	1
Ellis, Jules L.	1
Eser, Mehmet Taha	1
Huebner, Alan	1
Polat, Murat	1
Skar, Gustaf B.	1

Publication Type

Journal Articles	4
Reports - Research	4

Education Level

Elementary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Norway	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests