ERIC - Search Results

Publication Date

In 2025	1
Since 2024	7
Since 2021 (last 5 years)	17
Since 2016 (last 10 years)	38
Since 2006 (last 20 years)	69

Descriptor

Difficulty Level	107
Reading Tests	107
Test Items	107
Reading Comprehension	38
Test Construction	35
Item Response Theory	31
Multiple Choice Tests	28
Item Analysis	27
Foreign Countries	22
Test Format	22
Elementary School Students	21
English (Second Language)	21
Language Tests	21
Test Validity	21
Achievement Tests	20
Mathematics Tests	20
Test Reliability	18
Comparative Analysis	17
Scores	16
Second Language Learning	15
Elementary Secondary Education	12
Statistical Analysis	12
Goodness of Fit	11
Computer Assisted Testing	10
Grade 4	10
More ▼

Publication Type

Reports - Research	81
Journal Articles	58
Speeches/Meeting Papers	20
Reports - Evaluative	19
Numerical/Quantitative Data	13
Tests/Questionnaires	8
Reports - Descriptive	7
Information Analyses	3
Dissertations/Theses -…	1

Education Level

Elementary Education	25
Secondary Education	13
Elementary Secondary Education	11
Intermediate Grades	11
Primary Education	11
Grade 4	10
Grade 5	10
Middle Schools	10
Early Childhood Education	9
Grade 7	8
Junior High Schools	8
Grade 2	7
Grade 3	7
Grade 8	7
Grade 6	6
Higher Education	6
Postsecondary Education	5
Grade 1	4
Grade 9	2
High Schools	2
Kindergarten	2
Adult Basic Education	1
Adult Education	1
Grade 11	1
More ▼

Audience

Researchers	3
Policymakers	1
Practitioners	1

Location

California	3
Turkey	3
Australia	2
Idaho	2
Iran	2
Virginia	2
Alabama	1
Arizona	1
Arkansas	1
Colorado	1
District of Columbia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Illinois	1
Indiana	1
Italy	1
Japan	1
Malaysia	1
Maryland	1
Massachusetts	1
Mexico	1
Michigan	1
Minnesota	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 107 results Save | Export

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Using Full-Information Item Analysis to Improve Item Quality

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021

Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…

Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity

Peer reviewed

Direct link

Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025

Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…

Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics

A Comparison of Polytomous Rasch Models for the Analysis of C-Tests

Peer reviewed
PDF on ERIC

Download full text

Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022

A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…

Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests

Response Demands of Reading Comprehension Test Items: A Review of Item Difficulty Modeling Studies

Peer reviewed

Direct link

Ferrara, Steve; Steedle, Jeffrey T.; Frantz, Roger S. – Applied Measurement in Education, 2022

Item difficulty modeling studies involve (a) hypothesizing item features, or item response demands, that are likely to predict item difficulty with some degree of accuracy; and (b) entering the features as independent variables into a regression equation or other statistical model to predict difficulty. In this review, we report findings from 13…

Descriptors: Reading Comprehension, Reading Tests, Test Items, Item Response Theory

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity. EdWorkingPaper No. 23-881

Download full text

Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2024

Analyzing heterogeneous treatment effects (HTE) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and pre-intervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…

Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics

An Investigation of Item Parameter Invariance Using Focused Calibration Samples for MAP Growth

Download full text

He, Wei – NWEA, 2021

New MAP® Growth™ assessments are being developed that administer items more closely matched to the grade level of the student. However, MAP Growth items are calibrated with samples that typically consist of students from a variety of grades, including the target grade to which an item is aligned. While this choice of calibration sample is…

Descriptors: Achievement Tests, Test Items, Instructional Program Divisions, Difficulty Level

Text Complexity of Cambridge-Delivered IELTS Academic Reading Tests: Comparability with IELTS Academic Reading Practice Tests from Other Publishers

Peer reviewed
PDF on ERIC

Download full text

Huu Thanh Minh Nguyen; Nguyen Van Anh Le – TESL-EJ, 2024

Comparing language tests and test preparation materials holds important implications for the latter's validity and reliability. However, not enough studies compare such materials across a wide range of indices. Therefore, this study investigated the text complexity of IELTS academic reading tests (IRT) and IELTS reading practice tests (IRPrT).…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Readability

Language Assessment at a Thai University: A CEFR-Based Test of English Proficiency Development

Peer reviewed
PDF on ERIC

Download full text

Budi Waluyo; Ali Zahabi; Luksika Ruangsung – rEFLections, 2024

The increasing popularity of the Common European Framework of Reference (CEFR) in non-native English-speaking countries has generated a demand for concrete examples in the creation of CEFR-based tests that assess the four main English skills. In response, this research endeavors to provide insight into the development and validation of a…

Descriptors: Language Tests, Language Proficiency, Undergraduate Students, Language Skills

MAP Growth Item Parameter Drift Study

Download full text

He, Wei – NWEA, 2022

To ensure that student academic growth in a subject area is accurately captured, it is imperative that the underlying scale remains stable over time. As item parameter stability constitutes one of the factors that affects scale stability, NWEA® periodically conducts studies to check for the stability of the item parameter estimates for MAP®…

Descriptors: Achievement Tests, Test Items, Test Reliability, Academic Achievement

The Perceived and Measured Difficulty of Texts and Tasks in L1 and L2

Peer reviewed

Direct link

Monika Grotek; Agnieszka Slezak-Swiat – Reading in a Foreign Language, 2024

The study investigates the effect of the perception of text and task difficulty on adults' performance in reading tests in L1 and L2. The relationship between the following variables is studied: (a) readers' perception of text and task difficulty in L1 and L2 measured in a self-reported post-task questionnaire, (b) the number of correct answers to…

Descriptors: Difficulty Level, Second Language Learning, Eye Movements, Task Analysis

A Scale Development Study for Determining Caricature Reading Skills of Students

Peer reviewed
PDF on ERIC

Download full text

Cifci, Musa; Kaplan, Kadir – Turkish Online Journal of Educational Technology - TOJET, 2020

An achievement test was prepared to determine students' caricature reading skills. In the first draft of the achievement test, 32 test items and four choices were prepared for each question. The item analysis of the data obtained from the pre-application was made and the internal consistency coefficient (KR-20) was calculated as 0.67 for the…

Descriptors: Reading Tests, Achievement Tests, Reading Skills, Literary Devices

Investigating the Effect of Different Selected-Response Item Formats for Reading Comprehension

Peer reviewed

Direct link

Becker, Anthony; Nekrasova-Beker, Tatiana – Educational Assessment, 2018

While previous research has identified numerous factors that contribute to item difficulty, studies involving large-scale reading tests have provided mixed results. This study examined five selected-response item types used to measure reading comprehension in the Pearson Test of English Academic: a) multiple-choice (choose one answer), b)…

Descriptors: Reading Comprehension, Test Items, Reading Tests, Test Format

Rasch Testlet Model and Bifactor Analysis: How Do They Assess the Dimensionality of Large-Scale Iranian EFL Reading Comprehension Tests?

Peer reviewed

Direct link

Geramipour, Masoud – Language Testing in Asia, 2021

Rasch testlet and bifactor models are two measurement models that could deal with local item dependency (LID) in assessing the dimensionality of reading comprehension testlets. This study aimed to apply the measurement models to real item response data of the Iranian EFL reading comprehension tests and compare the validity of the bifactor models…

Descriptors: Foreign Countries, Second Language Learning, English (Second Language), Reading Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Behavioral Research and…	10
Educational and Psychological…	6
ETS Research Report Series	5
Educational Assessment	4
Applied Measurement in…	3
Language Assessment Quarterly	3
American Annals of the Deaf	2
Australasian Journal of…	2
Grantee Submission	2
International Journal of…	2
NWEA	2
Pearson	2
Reading in a Foreign Language	2
Smarter Balanced Assessment…	2
Achieve, Inc.	1
Annenberg Institute for…	1
Assessment for Effective…	1
Assessment in Education:…	1
Educational Assessment,…	1
English Language Teaching	1
European Journal of…	1
Foreign Language Annals	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational…	1
More ▼

Tindal, Gerald	11
Alonzo, Julie	8
Liu, Kimy	5
Baghaei, Purya	2
Basaraba, Deni	2
Benjamin W. Domingue	2
He, Wei	2
Joshua B. Gilbert	2
Ketterlin-Geller, Leanne R.	2
Luke W. Miratrix	2
Meyers, Jason L.	2
Mott, David E. W.	2
Mridul Joshi	2
Park, Bitnara Jasmine	2
Pomplun, Mark	2
Ritchie, Timothy	2
Rodriguez, Michael C.	2
Rubin, Lois S.	2
Steedle, Jeffrey T.	2
Sundstrom-Hebert, Krystal	2
Turhan, Ahmet	2
Abu Kassim, Noor Lide	1
Agnieszka Slezak-Swiat	1
Akyol, Hayati	1
More ▼

Test of English as a Foreign…	4
Measures of Academic Progress	3
National Assessment of…	3
Program for International…	3
Progress in International…	3
Stanford Achievement Tests	3
Test of English for…	3
Comprehensive Tests of Basic…	2
Dynamic Indicators of Basic…	2
International English…	2
SAT (College Admission Test)	2
Sequential Tests of…	2
Wechsler Individual…	2
ACT Assessment	1
Alabama High School…	1
California Achievement Tests	1
Childrens Manifest Anxiety…	1
Flesch Kincaid Grade Level…	1
Flesch Reading Ease Formula	1
Gates MacGinitie Reading Tests	1
Iowa Tests of Basic Skills	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
SRA Achievement Series	1
Trends in International…	1
More ▼