ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Generalizability Theory	5
Grade 8	5
Test Reliability	5
Foreign Countries	3
Mathematics Tests	3
Test Items	3
Test Theory	3
Error of Measurement	2
Interrater Reliability	2
Item Response Theory	2
Scores	2
Academic Standards	1
Alignment (Education)	1
Alternative Assessment	1
Curriculum Based Assessment	1
Difficulty Level	1
Disabilities	1
Grade 10	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
Middle Schools	1
Reading Tests	1
Secondary School Students	1
More ▼

Source

Applied Measurement in…	1
Behavioral Research and…	1
Educational Sciences: Theory…	1
International Journal of…	1
Practical Assessment,…	1

Author

Alonzo, Julie	1
Anderson, Daniel	1
Atilgan, Hakan	1
Basokcu, Tahsin Oguz	1
Demir, Elif Kübra	1
Gelbal, Selahattin	1
Guler, Nese	1
Huebner, Alan	1
Ogretmen, Tuncay	1
Pastor, Dena A.	1
Skar, Gustaf B.	1
Taylor, Melinda Ann	1
Tindal, Gerald	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	4
Numerical/Quantitative Data	1

Education Level

Grade 8	5
Elementary Education	4
Junior High Schools	4
Middle Schools	4
Secondary Education	4
Intermediate Grades	2
Elementary Secondary Education	1
Grade 10	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
High Schools	1
More ▼

Audience

Location

Norway	1
Turkey	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Studying Reliability of Open Ended Mathematics Items According to the Classical Test Theory and Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010

In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability