Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Generalizability Theory | 5 |
Grade 8 | 5 |
Test Reliability | 5 |
Foreign Countries | 3 |
Mathematics Tests | 3 |
Test Items | 3 |
Test Theory | 3 |
Error of Measurement | 2 |
Interrater Reliability | 2 |
Item Response Theory | 2 |
Scores | 2 |
More ▼ |
Source
Applied Measurement in… | 1 |
Behavioral Research and… | 1 |
Educational Sciences: Theory… | 1 |
International Journal of… | 1 |
Practical Assessment,… | 1 |
Author
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Atilgan, Hakan | 1 |
Basokcu, Tahsin Oguz | 1 |
Demir, Elif Kübra | 1 |
Gelbal, Selahattin | 1 |
Guler, Nese | 1 |
Huebner, Alan | 1 |
Ogretmen, Tuncay | 1 |
Pastor, Dena A. | 1 |
Skar, Gustaf B. | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Journal Articles | 4 |
Numerical/Quantitative Data | 1 |
Education Level
Grade 8 | 5 |
Elementary Education | 4 |
Junior High Schools | 4 |
Middle Schools | 4 |
Secondary Education | 4 |
Intermediate Grades | 2 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
More ▼ |
Audience
Location
Norway | 1 |
Turkey | 1 |
Turkey (Ankara) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020
It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…
Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability
Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013
Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…
Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability