Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Generalizability Theory | 4 |
Grade 8 | 4 |
Interrater Reliability | 4 |
Foreign Countries | 2 |
Test Items | 2 |
Test Reliability | 2 |
Automation | 1 |
Computer Assisted Testing | 1 |
Concept Mapping | 1 |
Equated Scores | 1 |
Essays | 1 |
More ▼ |
Source
Applied Measurement in… | 1 |
Educational Sciences: Theory… | 1 |
International Journal of… | 1 |
Journal of Technology,… | 1 |
Author
Atilgan, Hakan | 1 |
Basokcu, Tahsin Oguz | 1 |
Ben-Simon, Anat | 1 |
Bennett, Randy Elliott | 1 |
Demir, Elif Kübra | 1 |
Gelbal, Selahattin | 1 |
Guler, Nese | 1 |
Ogretmen, Tuncay | 1 |
Shavelson, Richard J. | 1 |
Yin, Yue | 1 |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Education Level
Grade 8 | 4 |
Elementary Education | 2 |
Grade 9 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
California | 1 |
Turkey | 1 |
Turkey (Ankara) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020
It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…
Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability
Yin, Yue; Shavelson, Richard J. – Applied Measurement in Education, 2008
In the first part of this article, the use of Generalizability (G) theory in examining the dependability of concept map assessment scores and designing a concept map assessment for a particular practical application is discussed. In the second part, the application of G theory is demonstrated by comparing the technical qualities of two frequently…
Descriptors: Generalizability Theory, Concept Mapping, Validity, Reliability
Ben-Simon, Anat; Bennett, Randy Elliott – Journal of Technology, Learning, and Assessment, 2007
This study evaluated a "substantively driven" method for scoring NAEP writing assessments automatically. The study used variations of an existing commercial program, e-rater[R], to compare the performance of three approaches to automated essay scoring: a "brute-empirical" approach in which variables are selected and weighted solely according to…
Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays