ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	50

Descriptor

Item Response Theory	81
Testing Programs	81
Test Construction	27
Test Items	25
Equated Scores	19
Achievement Tests	18
State Programs	18
Mathematics Tests	15
Psychometrics	15
Models	13
Scaling	13
Scoring	13
Test Format	13
Test Reliability	13
Test Validity	13
Comparative Analysis	12
Elementary Secondary Education	12
Error of Measurement	12
Evaluation Methods	12
Item Analysis	12
Reading Tests	11
Testing	11
Educational Assessment	10
Grade 6	10
Grade 8	10
More ▼

Publication Type

Journal Articles	42
Reports - Research	38
Reports - Evaluative	27
Speeches/Meeting Papers	15
Numerical/Quantitative Data	14
Reports - Descriptive	10
Information Analyses	3
Books	2
Collected Works - General	2
Opinion Papers	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	12
Elementary Secondary Education	11
Grade 4	10
Grade 6	10
Grade 8	9
Secondary Education	9
Grade 3	8
Grade 5	8
Grade 7	7
Intermediate Grades	7
Junior High Schools	7
Middle Schools	7
Early Childhood Education	6
Higher Education	6
Primary Education	6
Postsecondary Education	2
Adult Education	1
Grade 1	1
Grade 10	1
Grade 11	1
Grade 9	1
High Schools	1
More ▼

Audience

Location

New York	5
Tunisia	2
Australia	1
Azerbaijan	1
Botswana	1
Canada	1
China (Shanghai)	1
Finland	1
Florida	1
Greece	1
Hawaii	1
Honduras	1
Hong Kong	1
Illinois	1
Indonesia	1
Liechtenstein	1
Montenegro	1
Netherlands	1
New Zealand	1
Panama	1
Peru	1
Singapore	1
Slovenia	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	5
Trends in International…	4
Law School Admission Test	2
Program for International…	2
ACT Assessment	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
National Longitudinal Study…	1
North Carolina End of Course…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

How Does Calibration Timing and Seasonality Affect Item Parameter Estimates?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational and Psychological Measurement, 2016

Continuously administered examination programs, particularly credentialing programs that require graduation from educational programs, often experience seasonality where distributions of examine ability may differ over time. Such seasonality may affect the quality of important statistical processes, such as item response theory (IRT) item…

Descriptors: Test Items, Item Response Theory, Computation, Licensing Examinations (Professions)

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education

Peer reviewed
PDF on ERIC

Download full text

Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015

This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…

Descriptors: Case Studies, Simulation, International Programs, Testing Programs

Limited-Information Goodness-of-Fit Testing of Diagnostic Classification Item Response Theory Models. CRESST Report 840

Download full text

Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

It is a well-known problem in testing the fit of models to multinomial data that the full underlying contingency table will inevitably be sparse for tests of reasonable length and for realistic sample sizes. Under such conditions, full-information test statistics such as Pearson's X[superscript 2] and the likelihood ratio statistic G[superscript…

Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics

New York State Testing Program 2018: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2018

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…

Descriptors: English, Language Arts, Language Tests, Mathematics Tests

Limited-Information Goodness-of-Fit Testing of Diagnostic Classification Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016

Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…

Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

Student, School, and Country Differences in Sustained Test-Taking Effort in the 2009 PISA Reading Assessment

Peer reviewed

Direct link

Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014

In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…

Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences

New York State Testing Program 2017: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2017

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…

Descriptors: English, Language Arts, Language Tests, Mathematics Tests

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

A Graphical Approach to Evaluating Equating Using Test Characteristic Curves

Peer reviewed

Direct link

Wyse, Adam E.; Reckase, Mark D. – Applied Psychological Measurement, 2011

An essential concern in the application of any equating procedure is determining whether tests can be considered equated after the tests have been placed onto a common scale. This article clarifies one equating criterion, the first-order equity property of equating, and develops a new method for evaluating equating that is linked to this…

Descriptors: Lawyers, Licensing Examinations (Professions), Testing Programs, Graphs

The Long-Term Sustainability of Different Item Response Theory Scaling Methods

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert R. – Educational and Psychological Measurement, 2011

This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…

Descriptors: Item Response Theory, Scaling, Sustainability, Classification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Applied Psychological…	8
Educational and Psychological…	8
Applied Measurement in…	6
Journal of Educational…	5
New York State Education…	5
Behavioral Research and…	4
ETS Research Report Series	3
Journal of Applied Testing…	3
ACT, Inc.	2
Educational Measurement:…	2
Journal of Educational and…	2
Anatomical Sciences Education	1
Council of Chief State School…	1
Elementary School Journal	1
Grantee Submission	1
Journal of Research in…	1
Language Testing	1
National Center for Research…	1
Online Submission	1
Pearson	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Alonzo, Julie	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Tindal, Gerald	4
Wyse, Adam E.	4
van der Linden, Wim J.	3
Albano, Anthony D.	2
Baghi, Heibatollah	2
Cai, Li	2
Fan, Xitao	2
Forsyth, Robert A.	2
Hansen, Mark	2
Jiao, Hong	2
Keller, Lisa A.	2
Lee, Guemin	2
Lewis, Daniel M.	2
Li, Zhen	2
Meyers, Jason L.	2
Monroe, Scott	2
Petersen, Nancy S.	2
Reese, Lynda M.	2
Rock, Donald A.	2
Sireci, Stephen G.	2
More ▼