ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	15

Descriptor

Difficulty Level	15
Grade 4	15
Item Response Theory	15
Test Items	15
Grade 3	9
Elementary School Students	7
Grade 5	7
Mathematics Tests	7
Grade 6	6
Grade 7	6
Grade 8	6
Reading Tests	5
Achievement Tests	4
Test Bias	4
Error of Measurement	3
Foreign Countries	3
Mathematics Achievement	3
Pilot Projects	3
Public Schools	3
Reading Comprehension	3
Test Construction	3
Test Format	3
Test Theory	3
Test Validity	3
Computation	2
More ▼

Source

Behavioral Research and…	5
Grantee Submission	2
Applied Measurement in…	1
Educational Assessment	1
Educational Assessment,…	1
Educational Research and…	1
Educational and Psychological…	1
Journal of Educational and…	1
Large-scale Assessments in…	1
ProQuest LLC	1

Publication Type

Reports - Research	12
Journal Articles	7
Numerical/Quantitative Data	5
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Elementary Education	15
Grade 4	15
Grade 3	9
Intermediate Grades	8
Grade 5	7
Grade 6	6
Grade 7	6
Grade 8	6
Junior High Schools	6
Middle Schools	6
Secondary Education	6
Early Childhood Education	4
Primary Education	4
Elementary Secondary Education	3
Grade 1	1
Grade 2	1
Kindergarten	1
More ▼

Audience

Location

Austria	1
Belgium	1
California	1
Colorado	1
Florida	1
Germany	1
Illinois	1
Indiana	1
Luxembourg	1
New York	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…	2
Gates MacGinitie Reading Tests	1
Trends in International…	1
Wechsler Individual…	1
Wisconsin Knowledge and…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

Item Characteristic Curve Asymmetry: A Better Way to Accommodate Slips and Guesses than a Four-Parameter Model?

Peer reviewed

Direct link

Liao, Xiangyi; Bolt, Daniel M. – Journal of Educational and Behavioral Statistics, 2021

Four-parameter models have received increasing psychometric attention in recent years, as a reduced upper asymptote for item characteristic curves can be appealing for measurement applications such as adaptive testing and person-fit assessment. However, applications can be challenging due to the large number of parameters in the model. In this…

Descriptors: Test Items, Models, Mathematics Tests, Item Response Theory

Reader-Test Interactions: An Explanatory Item Response Study on Reading Comprehension

Direct link

Ping Wang – ProQuest LLC, 2021

According to the RAND model framework, reading comprehension test performance is influenced by readers' reading skills or reader characteristics, test properties, and their interactions. However, little empirical research has systematically compared the impacts of reader characteristics, test properties, and reader-test interactions across…

Descriptors: Reading Comprehension, Reading Tests, Reading Research, Test Items

Psychometric Report for the Early Fractions Test Administered with Third- and Fourth-Grade Students in Fall 2016. Research Report No. 2017-10

Download full text

Schoen, Robert C.; Liu, Sicong; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test is to serve as a student pretest covariate and a test of baseline equivalence in the larger study. In this report, we discuss our…

Descriptors: Mathematics Achievement, Fractions, Mathematics Tests, Grade 3

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Effects of Item Parameter Drift on Vertical Scaling with the Nonequivalent Groups with Anchor Test (NEAT) Design

Peer reviewed

Direct link

Ye, Meng; Xin, Tao – Educational and Psychological Measurement, 2014

The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…

Descriptors: Scaling, Test Items, Equated Scores, Achievement Gains

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

The Development and Scaling of the easyCBM CCSS Elementary Mathematics Measures: Grade 4. Technical Report #1318

Download full text

Irvin, P. Shawn; Saven, Jessica L.; Alonzo, Julie; Park, Bitnara Jasmine; Anderson, Daniel; Tindal, Gerald – Behavioral Research and Teaching, 2012

The results of formative assessments are regularly used to inform important instructional decisions (e.g., targeted intervention) within a response to intervention (RTI) system of teaching and learning. The validity of such instructional decision-making depends, in part, on the alignment between formative measures and the academic content…

Descriptors: Elementary School Mathematics, Curriculum Based Assessment, Mathematics Tests, Academic Standards

How Do Different Versions of a Test Instrument Function in a Single Language? A DIF Analysis of the PIRLS 2006 German Assessments

Peer reviewed

Direct link

Stubbe, Tobias C. – Educational Research and Evaluation, 2011

The challenge inherent in cross-national research of providing instruments in different languages measuring the same construct is well known. But even instruments in a single language may be biased towards certain countries or regions due to local linguistic specificities. Consequently, it may be appropriate to use different versions of an…

Descriptors: Test Items, International Studies, Foreign Countries, German

Linguistic Complexity, Schematic Representations, and Differential Item Functioning for English Language Learners in Math Tests

Peer reviewed

Direct link

Martiniello, Maria – Educational Assessment, 2009

This article examines nonmathematical linguistic complexity as a source of differential item functioning (DIF) in math word problems for English language learners (ELLs). Specifically, this study investigates the relationship between item measures of linguistic complexity, nonlinguistic forms of representation and DIF measures based on item…

Descriptors: Mathematics Tests, Grade 4, Test Bias, Word Problems (Mathematics)

Instrument Development Procedures for Rapid Reading Rate Measures. Technical Report # 08-05

Download full text

Liu, Kimy; Carling, Kristy; Geller, Leanne Ketterlin; Tindal, Gerald – Behavioral Research and Teaching, 2008

In this study, we describe the development of rapid reading measures, sentences presented to students in a nearly subliminal manner, with a literal comprehension question asked following their removal. After administering alternate forms of these measures to students, we present the results from three statistical analyses to ascertain their…

Descriptors: Test Construction, Speed Reading, Reading Rate, Sentences

Instrument Development Procedures for Silent Reading Measures. Technical Report Number 08-03

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop and gather validity evidence for silent reading fluency passages. A number of passages were written following a traditional story grammar structure (character, setting, events) and placed on a computer for students to read silently. We describe in detail, the manner in which content-related evidence was…

Descriptors: Silent Reading, Reading Fluency, Reading Tests, Test Validity

Instrument Development Procedures for Maze Measures. Technical Report # 08-06

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…

Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests

The Development of Early Literacy Measures for Use in a Progress Monitoring Assessment System: Letter Names, Letter Sounds and Phoneme Segmenting. Technical Report # 39

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2007

In this technical report, the authors describe the development alternate forms of three types of early literacy measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fourth grade. They begin with a brief overview of the two conceptual frameworks underlying…

Descriptors: Emergent Literacy, Measures (Individuals), Naming, Alphabets

Tindal, Gerald	5
Liu, Kimy	3
Alonzo, Julie	2
Ketterlin-Geller, Leanne R.	2
Liu, Sicong	2
Paek, Insu	2
Schoen, Robert C.	2
Sundstrom-Hebert, Krystal	2
Yang, Xiaotong	2
Anderson, Daniel	1
Andrés Christiansen	1
Bolt, Daniel M.	1
Carling, Kristy	1
Cho, Hyun-Jeong	1
Daniel M. Bolt	1
Geller, Leanne Ketterlin	1
Irvin, P. Shawn	1
Kingston, Neal	1
Lee, Jaehoon	1
Liao, Xiangyi	1
Martiniello, Maria	1
Park, Bitnara Jasmine	1
Ping Wang	1
Qi Huang	1
Rianne Janssen	1
More ▼