ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	19

Descriptor

Foreign Countries	22
Generalizability Theory	22
Test Reliability	22
Interrater Reliability	9
Statistical Analysis	5
Student Evaluation	5
Scores	4
Test Items	4
Test Theory	4
Writing Tests	4
Error of Measurement	3
Grade 8	3
Item Response Theory	3
Secondary School Students	3
Test Validity	3
Writing Evaluation	3
Academic Achievement	2
Attitude Measures	2
English (Second Language)	2
Evaluation Methods	2
Factor Analysis	2
Goodness of Fit	2
Grade 7	2
Higher Education	2
Instructional Effectiveness	2
More ▼

Publication Type

Journal Articles	21
Reports - Research	19
Reports - Evaluative	3
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	7
Elementary Education	6
Middle Schools	5
Secondary Education	5
Junior High Schools	4
Postsecondary Education	4
Grade 8	3
Early Childhood Education	2
Grade 7	2
Grade 12	1
Grade 3	1
Grade 6	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Researchers

Location

Turkey	6
Canada	4
Turkey (Ankara)	3
Cyprus	2
Norway	2
China	1
Finland (Helsinki)	1
Mexico (Mexico City)	1
Netherlands	1
South Korea	1
Turkey (Istanbul)	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Environment…

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

A Short Note on Optimizing Cost-Generalizability via a Machine-Learning Approach

Peer reviewed

Direct link

Jiang, Zhehan; Shi, Dexin; Distefano, Christine – Educational and Psychological Measurement, 2021

The costs of an objective structured clinical examination (OSCE) are of concern to health profession educators globally. As OSCEs are usually designed under generalizability theory (G-theory) framework, this article proposes a machine-learning-based approach to optimize the costs, while maintaining the minimum required generalizability…

Descriptors: Artificial Intelligence, Generalizability Theory, Objective Tests, Foreign Countries

Comparison of G and Phi Coefficients Estimated in Generalizability Theory with Real Cases

Peer reviewed
PDF on ERIC

Download full text

Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021

This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…

Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Reliability of Essay Ratings: A Study on Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan – Eurasian Journal of Educational Research, 2019

Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…

Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability

Reliability of the Analytic Rubric and Checklist for the Assessment of Story Writing Skills: G and Decision Study in Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Alici, Devrim; Aktas, Mehtap – European Journal of Educational Research, 2019

The purpose of study is to examine the reliability of analytical rubrics and checklists developed for the assessment of story writing skills by means of generalizability theory. The study group consisted of 52 students attending the 5th grade at primary school and 20 raters in Mersin University. The G study was carried out with the fully crossed…

Descriptors: Foreign Countries, Scoring Rubrics, Check Lists, Writing Tests

Psychometric Properties of MATE: A Study Focused on Testing the Generalizability of the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021

The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…

Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Generalizability Theory Research on Developing a Scoring Rubric to Assess Primary School Students' Problem Posing Skills

Peer reviewed

Direct link

Cankoy, Osman; Özder, Hasan – EURASIA Journal of Mathematics, Science & Technology Education, 2017

The aim of this study is to develop a scoring rubric to assess primary school students' problem posing skills. The rubric including five dimensions namely solvability, reasonability, mathematical structure, context and language was used. The raters scored the students' problem posing skills both with and without the scoring rubric to test the…

Descriptors: Generalizability Theory, Elementary School Students, Foreign Countries, Problem Solving

Exploring the Reliability of Generic and Content-Specific Instructional Aspects in Physical Education Lessons

Peer reviewed

Direct link

Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017

Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…

Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability

Measurement Quality of the Chinese Early Childhood Program Rating Scale: An Investigation Using Multivariate Generalizability Theory

Peer reviewed

Direct link

Chen, Dezhi; Hu, Bi Ying; Fan, Xitao; Li, Kejian – Journal of Psychoeducational Assessment, 2014

Adapted from the Early Childhood Environment Rating Scale-Revised, the Chinese Early Childhood Program Rating Scale (CECPRS) is a culturally comparable measure for assessing the quality of early childhood education and care programs in the Chinese cultural/social contexts. In this study, 176 kindergarten classrooms were rated with CECPRS on eight…

Descriptors: Foreign Countries, Rating Scales, Early Childhood Education, Educational Environment

The Effects of Testlets on Reliability and Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Teker, Gulsen Tasdelen; Dogan, Nuri – Educational Sciences: Theory and Practice, 2015

Reliability and differential item functioning (DIF) analyses were conducted on testlets displaying local item dependence in this study. The data set employed in the research was obtained from the answers given by 1,500 students to the 20 items included in six testlets given in English Proficiency Exam by the School of Foreign Languages of a state…

Descriptors: Foreign Countries, Test Items, Test Bias, Item Response Theory

Assessing Reading Comprehension in Adolescent Low Achievers: Subskills Identification and Task Specificity

Peer reviewed

Direct link

van Steensel, Roel; Oostdam, Ron; van Gelderen, Amos – Language Testing, 2013

On the basis of a validation study of a new test for assessing low-achieving adolescents' reading comprehension skills--the SALT-reading--we analyzed two issues relevant to the field of reading test development. Using the test results of 200 seventh graders, we examined the possibility of identifying reading comprehension subskills and the effects…

Descriptors: Adolescents, Low Achievement, Reading Comprehension, Reading Tests

The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

Peer reviewed

Direct link

Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…

Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance

Previous Page | Next Page »

Pages: 1 | 2

Educational Sciences: Theory…	2
Alberta Journal of…	1
Asian Journal of Education…	1
Assessment & Evaluation in…	1
Canadian Journal of…	1
EURASIA Journal of…	1
Educational Research Quarterly	1
Educational and Psychological…	1
Eurasian Journal of…	1
European Journal of…	1
Intelligence	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Psychoeducational…	1
Language Testing	1
Practical Assessment,…	1
School Effectiveness and…	1
TESOL Journal	1
More ▼

Atilgan, Hakan	3
Aktas, Mehtap	2
Uzun, N. Bilge	2
Alici, Devrim	1
Asiret, Semih	1
Basokcu, Tahsin Oguz	1
Cankoy, Osman	1
Charalambous, Charalambos Y.	1
Chen, Dezhi	1
Demir, Elif Kübra	1
Deniz, Kaan Zulfikar	1
Distefano, Christine	1
Dogan, Nuri	1
Fan, Xitao	1
Follesdal, Hallvard	1
Gelbal, Selahattin	1
Gierl, Mark J.	1
Gipps, Caroline V.	1
Guler, Nese	1
Ha, Minsu	1
Hagtvet, Knut A.	1
Hu, Bi Ying	1
Huang, Jinyan	1
Huebner, Alan	1
Ilican, Emel	1
More ▼