ERIC - Search Results

Publication Date

In 2025	5
Since 2024	19
Since 2021 (last 5 years)	73
Since 2016 (last 10 years)	176
Since 2006 (last 20 years)	445

Descriptor

Generalizability Theory	734
Reliability	168
Scores	146
Error of Measurement	134
Test Reliability	126
Interrater Reliability	120
Foreign Countries	103
Statistical Analysis	85
Evaluation Methods	83
Psychometrics	75
Research Methodology	68
Validity	66
Test Validity	65
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	116
Postsecondary Education	69
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Generalizability Theory X

Showing 151 to 165 of 734 results Save | Export

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Evaluating Score and Decision Consistency across Claims in a Validation Argument

Peer reviewed

Direct link

Schmidgall, Jonathan – Applied Measurement in Education, 2017

This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…

Descriptors: Scores, Reliability, Validity, Generalizability Theory

Recommendations for Replication Research in Special Education: A Framework of Systematic, Conceptual Replications

Peer reviewed

Direct link

Coyne, Michael D.; Cook, Bryan G.; Therrien, William J. – Remedial and Special Education, 2016

Special education researchers conduct studies that can be considered replications. However, they do not often refer to them as replication studies. The purpose of this article is to consider the potential benefits of conceptualizing special education intervention research within a framework of systematic, conceptual replication. Specifically, we…

Descriptors: Special Education, Replication (Evaluation), Research Needs, Research Methodology

Inter-Rater Reliability and Generalizability of Patient Note Scores Using a Scoring Rubric Based on the USMLE Step-2 CS Format

Peer reviewed

Direct link

Park, Yoon Soo; Hyderi, Abbas; Bordage, Georges; Xing, Kuan; Yudkowsky, Rachel – Advances in Health Sciences Education, 2016

Recent changes to the patient note (PN) format of the United States Medical Licensing Examination have challenged medical schools to improve the instruction and assessment of students taking the Step-2 clinical skills examination. The purpose of this study was to gather validity evidence regarding response process and internal structure, focusing…

Descriptors: Interrater Reliability, Generalizability Theory, Licensing Examinations (Professions), Physicians

Three Conceptual Replication Studies in Group Theory

Peer reviewed

Direct link

Melhuish, Kathleen – Journal for Research in Mathematics Education, 2018

Many studies in mathematics education research occur with a nonrepresentative sample and are never replicated. To challenge this paradigm, I designed a large-scale study evaluating student conceptions in group theory that surveyed a national, representative sample of students. By replicating questions previously used to build theory around student…

Descriptors: Replication (Evaluation), Scientific Research, Mathematics Education, Program Validation

Constructing and Evaluating a Validity Argument for the Final-Year Ward Simulation Exercise

Peer reviewed

Direct link

Till, Hettie; Ker, Jean; Myford, Carol; Stirling, Kevin; Mires, Gary – Advances in Health Sciences Education, 2015

The authors report final-year ward simulation data from the University of Dundee Medical School. Faculty who designed this assessment intend for the final score to represent an individual senior medical student's level of clinical performance. The results are included in each student's portfolio as one source of evidence of the student's…

Descriptors: Foreign Countries, Simulation, Clinical Experience, Medical Education

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Peer reviewed

Direct link

Smith, Martin M.; Saklofske, Donald H.; Yan, Gonggu; Sherry, Simon B. – Measurement and Evaluation in Counseling and Development, 2016

This study supports the generalizability of perfectionistic strivings and concerns across Canadian and Chinese university students (N = 1,006) and demonstrates the importance of establishing measurement invariance prior to hypothesis testing with different groups. No latent mean difference in perfectionistic concerns was observed, but Canadian…

Descriptors: Foreign Countries, Cultural Differences, Personality Traits, Hypothesis Testing

On Generalizability of MOOC Models

Peer reviewed
PDF on ERIC

Download full text

Kidzinsk, Lukasz; Sharma, Kshitij; Boroujeni, Mina Shirvani; Dillenbourg, Pierre – International Educational Data Mining Society, 2016

The big data imposes the key problem of generalizability of the results. In the present contribution, we discuss statistical tools which can help to select variables adequate for target level of abstraction. We show that a model considered as over-fitted in one context can be accurate in another. We illustrate this notion with an example analysis…

Descriptors: Generalizability Theory, Online Courses, Large Group Instruction, Models

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

An Evaluation of the Answer Key Used in Determining the 7th Grade Students' Levels of Disciplined Mind in Terms of Generalizability Theory

Peer reviewed

Direct link

Guler, Nese – Educational Research and Reviews, 2014

Nowadays, rapid changes in science and technology increase the demand of qualified individuals who have signs of disciplined mind which is hightlighted in Howard Gardner's (2006) five minds as one type of mind. So, it is important to measure whether individuals have disciplined mind or not. Based on this idea, it is aimed to evaluate the…

Descriptors: Answer Keys, Reliability, Grade 7, Generalizability Theory

Cross-Cultural Generalizability of Year in School Effects: Negative Effects of Acceleration and Positive Effects of Retention on Academic Self-Concept

Peer reviewed

Direct link

Marsh, Herbert W. – Journal of Educational Psychology, 2016

Given that the Big-Fish-Little-Pond-Effect, the negative effect of school-average achievement on academic self-concept, is one of the most robust findings in educational psychology (Marsh, Seaton et al., 2007), this research extends the theoretical model, based on social comparison theory, to study relative year in school effects (e.g., being 1…

Descriptors: Cross Cultural Studies, Acceleration (Education), Grade Repetition, Self Concept

Exploring the Reliability of Generic and Content-Specific Instructional Aspects in Physical Education Lessons

Peer reviewed

Direct link

Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017

Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…

Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability

Using Generalizability Theory to Examine Different Concept Map Scoring Methods

Peer reviewed
PDF on ERIC

Download full text

Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016

Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…

Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas

An Application of Multivariate Generalizability in Selection of Mathematically Gifted Students

Peer reviewed

Direct link

Kim, Sungyeun; Berebitsky, Dan – EURASIA Journal of Mathematics, Science & Technology Education, 2016

This study investigates error sources and the effects of each error source to determine optimal weights of the composite score of teacher recommendation letters and self-introduction letters using multivariate generalizability theory. Data were collected from the science education institute for the gifted attached to the university located within…

Descriptors: Academically Gifted, Foreign Countries, Mathematics, Mathematics Instruction

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	19
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	537
Reports - Research	430
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	58
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	20
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Books	6
Guides - Non-Classroom	6
Collected Works - General	3
Book/Product Reviews	2
Non-Print Media	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼