ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	44

Descriptor

Generalizability Theory	57
Item Response Theory	57
Test Reliability	14
Interrater Reliability	12
Psychometrics	12
Test Theory	11
Error of Measurement	10
Foreign Countries	10
Scores	10
Statistical Analysis	9
Test Items	9
Comparative Analysis	8
Item Analysis	7
Models	7
Reliability	7
Measurement Techniques	6
Rating Scales	6
Simulation	6
Validity	6
Computation	5
Cutting Scores	5
Equations (Mathematics)	5
Language Tests	5
Scoring	5
Scoring Rubrics	5
More ▼

Publication Type

Reports - Research	37
Journal Articles	35
Speeches/Meeting Papers	8
Reports - Descriptive	7
Reports - Evaluative	6
Numerical/Quantitative Data	5
Books	3
Collected Works - General	3
Information Analyses	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
Non-Print Media	1
Opinion Papers	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	11
Elementary Education	6
Postsecondary Education	6
Secondary Education	6
Grade 8	3
Junior High Schools	3
Middle Schools	3
Grade 3	2
Grade 7	2
Adult Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 9	1
Intermediate Grades	1
Two Year Colleges	1
More ▼

Audience

Location

South Korea	3
Netherlands	2
California	1
Colorado	1
Germany	1
Massachusetts	1
Minnesota	1
Missouri	1
Norway	1
Turkey	1
United Kingdom (Scotland)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Program for International…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

Generalizing beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features

Peer reviewed

Direct link

Maria Bolsinova; Jesper Tijmstra; Leslie Rutkowski; David Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Profile analysis is one of the main tools for studying whether differential item functioning can be related to specific features of test items. While relevant, profile analysis in its current form has two restrictions that limit its usefulness in practice: It assumes that all test items have equal discrimination parameters, and it does not test…

Descriptors: Test Items, Item Analysis, Generalizability Theory, Achievement Tests

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Using Many-Facet Rasch Measurement and Generalizability Theory to Explore Rater Effects for Direct Behavior Rating--Multi-Item Scales

Peer reviewed

Direct link

Anthony, Christopher J.; Styck, Kara M.; Volpe, Robert J.; Robert, Christopher R. – School Psychology, 2023

Although originally conceived of as a marriage of direct behavioral observation and indirect behavior rating scales, recent research has indicated that Direct Behavior Ratings (DBRs) are affected by rater idiosyncrasies (rater effects) similar to other indirect forms of behavioral assessment. Most of this research has been conducted using…

Descriptors: Item Response Theory, Generalizability Theory, Interrater Reliability, Behavior Rating Scales

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022

In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Not Just Generalizability: A Case for Multifaceted Latent Trait Models in Teacher Observation Systems

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Educational Researcher, 2019

Teacher evaluation systems often include classroom observations in which raters use rating scales to evaluate teachers' effectiveness. Recently, researchers have promoted the use of multifaceted approaches to investigating reliability using Generalizability theory, instead of rater reliability statistics. Generalizability theory allows analysts to…

Descriptors: Teacher Evaluation, Observation, Generalizability Theory, Item Response Theory

Psychometric Packages in R

Peer reviewed

Direct link

Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019

The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…

Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Grantee Submission, 2020

In this study, we examined the scoring and generalizability assumptions of an Explicit Instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Evaluation, Classroom Observation Techniques, Validity

Using a Many-Facet Rasch Model to Gain Insight into Measurement of Instructional Practice in Mathematics

Peer reviewed

Direct link

Robert Schoen; Lanrong Li; Xiaotong Yang; Ahmet Guven; Claire Riddell – Society for Research on Educational Effectiveness, 2021

Many classroom-observation instruments have been developed (e.g., Gleason et al., 2017; Nava et al., 2019; Sawada et al., 2002), but a very small number of studies published in refereed journals have rigorously examined the quality of the ratings and the instrument using measurement models. For example, Gleason et al. developed a mathematics…

Descriptors: Item Response Theory, Models, Measurement, Mathematics Instruction

Psychometric Properties of MATE: A Study Focused on Testing the Generalizability of the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021

The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…

Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)

A Comparison of Rubrics and Graded Category Rating Scales with Various Methods Regarding Raters' Reliability

Peer reviewed
PDF on ERIC

Download full text

Dogan, C. Deha; Uluman, Müge – Educational Sciences: Theory and Practice, 2017

The aim of this study was to determine the extent at which graded-category rating scales and rubrics contribute to inter-rater reliability. The research was designed as a correlational study. Study group consisted of 82 students attending sixth grade and three writing course teachers in a private elementary school. A performance task was…

Descriptors: Comparative Analysis, Scoring Rubrics, Rating Scales, Interrater Reliability

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Constructing and Evaluating a Validity Argument for the Final-Year Ward Simulation Exercise

Peer reviewed

Direct link

Till, Hettie; Ker, Jean; Myford, Carol; Stirling, Kevin; Mires, Gary – Advances in Health Sciences Education, 2015

The authors report final-year ward simulation data from the University of Dundee Medical School. Faculty who designed this assessment intend for the final score to represent an individual senior medical student's level of clinical performance. The results are included in each student's portfolio as one source of evidence of the student's…

Descriptors: Foreign Countries, Simulation, Clinical Experience, Medical Education

Sources of Variance in Special Educator Observation Rubrics

Peer reviewed
PDF on ERIC

Download full text

Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018

This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…

Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Online Submission	3
Society for Research on…	3
Advances in Health Sciences…	2
Applied Psychological…	2
Educational Sciences: Theory…	2
Educational and Psychological…	2
Grantee Submission	2
Journal of Educational and…	2
Language Assessment Quarterly	2
ACT, Inc.	1
AERA Online Paper Repository	1
Adapted Physical Activity…	1
Applied Measurement in…	1
Asia Pacific Education Review	1
Behavioral Research and…	1
Chemistry Education Research…	1
College Board	1
ETS Research Report Series	1
Educational Psychologist	1
Educational Researcher	1
IEEE Transactions on Learning…	1
Intelligence	1
International Education…	1
International Journal of…	1
International Journal of…	1
More ▼

Crawford, Angela R.	3
Johnson, Evelyn S.	3
Lee, Guemin	3
Moylan, Laura A.	3
Zheng, Yuzhu	3
Brennan, Robert L.	2
Lewis, Daniel M.	2
Salmani-Nodoushan, Mohammad…	2
Ahmet Guven	1
Alkahtani, Saif F.	1
Alonzo, Julie	1
Anderson, Daniel	1
Anthony, Christopher J.	1
Arce, Alvaro J.	1
Arendasy, Martin E.	1
Arthurs, Leilani	1
Attali, Yigal	1
Badjadi, Nour El Imane	1
Barbera, Jack	1
Barkaoui, Khaled	1
Bock, R. Darrell	1
Briggs, Derek C.	1
Claire Riddell	1
Clauser, Brian E.	1
More ▼