ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	27

Descriptor

Statistical Analysis	79
Test Theory	79
Test Items	28
Test Reliability	24
Item Response Theory	18
Correlation	17
Foreign Countries	15
Item Analysis	15
Mathematical Models	15
Scores	14
Test Construction	14
Comparative Analysis	13
Psychometrics	13
Test Validity	13
Criterion Referenced Tests	12
Career Development	10
Error of Measurement	10
Testing Problems	10
Factor Analysis	9
Latent Trait Theory	9
Difficulty Level	8
Reading Tests	8
Simulation	8
Language Tests	7
Scoring	7
More ▼

Publication Type

Reports - Research	79
Journal Articles	40
Speeches/Meeting Papers	18
Tests/Questionnaires	3
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Higher Education	9
Postsecondary Education	8
Elementary Education	6
Secondary Education	6
Middle Schools	5
Junior High Schools	4
Grade 8	3
Grade 7	2
Intermediate Grades	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 6	1
High Schools	1
Primary Education	1
More ▼

Audience

Researchers

Location

Turkey	3
Belgium	1
Brazil	1
Canada	1
Colorado	1
Cyprus	1
Germany	1
Hong Kong	1
Indonesia	1
Italy	1
Pakistan	1
Spain	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	2
Comprehensive Tests of Basic…	2
National Assessment of…	2
Wechsler Intelligence Scale…	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Defining Issues Test	1
Law School Admission Test	1
Piers Harris Childrens Self…	1
SAT (College Admission Test)	1
SRA Primary Mental Abilities…	1
Strengths and Difficulties…	1
Tennessee Self Concept Scale	1
Test of English as a Foreign…	1
Writing Apprehension Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

Estimating Treatment Effects with the Explanatory Item Response Model. EdWorkingPaper No. 22-677

Download full text

Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022

This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…

Descriptors: Item Response Theory, Models, Test Theory, Computation

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Modifying Spearman's Attenuation Equation to Yield Partial Corrections for Measurement Error--With Application to Sample Size Calculations

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2018

Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…

Descriptors: Error of Measurement, Correlation, Sample Size, Computation

Detecting and Treating Errors in Tests and Surveys

Peer reviewed

Direct link

von Davier, Matthias – Quality Assurance in Education: An International Perspective, 2018

Purpose: Surveys that include skill measures may suffer from additional sources of error compared to those containing questionnaires alone. Examples are distractions such as noise or interruptions of testing sessions, as well as fatigue or lack of motivation to succeed. This paper aims to provide a review of statistical tools based on latent…

Descriptors: Statistical Analysis, Surveys, International Assessment, Error Patterns

Investigating the Impact of Missing Data Handling Methods on the Detection of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Selvi, Hüseyin; Özdemir Alici, Devrim – International Journal of Assessment Tools in Education, 2018

In this study, it is aimed to investigate the impact of different missing data handling methods on the detection of Differential Item Functioning methods (Mantel Haenszel and Standardization methods based on Classical Test Theory and Likelihood Ratio Test method based on Item Response Theory). In this regard, on the data acquired from 1046…

Descriptors: Test Bias, Test Theory, Item Response Theory, Multiple Choice Tests

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Determination of Differential Item Functioning (DIF) According to SIBTEST, Lord's [Chi-squared], Raju's Area Measurement and Breslow-Day Methods

Peer reviewed
PDF on ERIC

Download full text

Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019

The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…

Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Gender Fairness within the Force Concept Inventory

Peer reviewed

Direct link

Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca – Physical Review Physics Education Research, 2018

Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…

Descriptors: Physics, Science Instruction, Sex Fairness, Gender Differences

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Group, Team, or Something in Between? Conceptualising and Measuring Team Entitativity

Peer reviewed
PDF on ERIC

Download full text

Vangrieken, Katrien; Boon, Anne; Dochy, Filip; Kyndt, Eva – Frontline Learning Research, 2017

The current gap between traditional team research and research focusing on non-strict teams or groups such as teacher teams hampers boundary-crossing investigations of and theorising on teamwork and collaboration. The main aim of this study includes bridging this gap by proposing a continuum-based team concept, describing the distinction between…

Descriptors: Teamwork, Teacher Researchers, Teacher Collaboration, Questionnaires

An Evaluation of the Psychometric Properties of Three Different Forms of Daly and Miller's Writing Apprehension Test through Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Güler, Nese; Ilhan, Mustafa; Güneyli, Ahmet; Demir, Süleyman – Educational Sciences: Theory and Practice, 2017

This study evaluates the psychometric properties of three different forms of the Writing Apprehension Test (WAT; Daly & Miller, 1975) through Rasch analysis. For this purpose, the fit statistics and correlation coefficients, and the reliability, separation ratio, and chi-square values for the facets of item and person calculated for the…

Descriptors: Writing Apprehension, Psychometrics, Item Response Theory, Tests

Students' Epistemologies about Experimental Physics: Validating the Colorado Learning Attitudes about Science Survey for Experimental Physics

Peer reviewed

Direct link

Wilcox, Bethany R.; Lewandowski, H. J. – Physical Review Physics Education Research, 2016

Student learning in instructional physics labs represents a growing area of research that includes investigations of students' beliefs and expectations about the nature of experimental physics. To directly probe students' epistemologies about experimental physics and support broader lab transformation efforts at the University of Colorado Boulder…

Descriptors: Physics, Epistemology, Surveys, Science Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	5
Educational and Psychological…	5
Journal of Educational…	3
International Journal of…	2
Language Testing	2
Physical Review Physics…	2
Alberta Journal of…	1
Annenberg Institute for…	1
Chemistry Education Research…	1
Current Issues in Education	1
EURASIA Journal of…	1
Early Education and…	1
Edinburgh Working Papers in…	1
Educational Sciences: Theory…	1
Frontline Learning Research	1
International Journal of…	1
Journal of Emotional and…	1
Journal of Interactive Online…	1
Journal of Pedagogical…	1
Journal of Research in…	1
New Horizons in Education	1
Psychological Bulletin	1
Quality Assurance in…	1
School Psychology Quarterly	1
School Psychology Review	1
More ▼

Bormuth, John R.	2
Levine, Michael V.	2
Livingston, Samuel A.	2
Yen, Wendy M.	2
Agus, Mirian	1
Atar, Hakan Yavuz	1
Ayva Yörü, Fatma Gökçen	1
Balch, William R.	1
Belfry, M. Joan	1
Bernholt, S.	1
Bhatti, Muhammad Tariq	1
Bigras, Marc	1
Boon, Anne	1
Botton, Chris	1
Broussard, Rolland L.	1
Brown, Chris	1
Brown, James Dean	1
Buchanan, Aaron	1
Budescu, David	1
Buhr, Dianne C.	1
Cantwell, Emily D.	1
Cliff, Norman	1
Common, Eric Alan	1
Coniam, David	1
More ▼