ERIC - Search Results

Publication Date

In 2026	0
Since 2025	18
Since 2022 (last 5 years)	66
Since 2017 (last 10 years)	165
Since 2007 (last 20 years)	324

Descriptor

Test Length	639
Test Items	228
Item Response Theory	201
Test Construction	151
Sample Size	141
Test Reliability	133
Computer Assisted Testing	121
Test Validity	113
Simulation	107
Adaptive Testing	101
Comparative Analysis	99
Test Format	91
Scores	88
Error of Measurement	78
Foreign Countries	73
Statistical Analysis	72
Correlation	68
Item Analysis	65
Computation	62
Models	62
Accuracy	61
Higher Education	61
Difficulty Level	57
Testing Problems	54
Monte Carlo Methods	52
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	23
Elementary Education	21
Middle Schools	12
High Schools	11
Elementary Secondary Education	10
Junior High Schools	9
Early Childhood Education	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Peru	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 639 results Save | Export

IRT Scoring and Recursion for Estimating Reliability and Other Accuracy Indices

Peer reviewed

Direct link

Tim Moses; YoungKoung Kim – Journal of Educational Measurement, 2025

This study considers the estimation of marginal reliability and conditional accuracy measures using a generalized recursion procedure with several IRT-based ability and score estimators. The estimators include MLE, TCC, and EAP abilities, and corresponding test scores obtained with different weightings of the item scores. We consider reliability…

Descriptors: Item Response Theory, Scoring, Reliability, Accuracy

Toward Cross-Disciplinary Translation of the Testing Effect: A Systematic Replication

Peer reviewed

Direct link

Kathryn R. Glodowski; Yusuke Hayashi – Journal of Applied Behavior Analysis, 2025

The testing effect is a well-established phenomenon in cognitive psychology that refers to enhanced long-term retention of information due to active recalling through testing. Following a cross-disciplinary translation of the testing effect into behavioral principles, we systematically replicated the previous findings in a behavior-analytic…

Descriptors: Testing, Replication (Evaluation), Tests, Test Length

Interpreting Scores on the Enhanced ACT: Guidance for K-12 and Higher Education Institutions. ACT State and Federal Policy

Download full text

James Riddlesperger – ACT Education Corp., 2025

ACT announced a series of enhancements designed to modernize the ACT test and offer students more choice and flexibility in demonstrating their readiness for life after high school. The enhancements provide students more flexibility by allowing them to choose whether to take the science assessment, thereby reducing the test length by up to…

Descriptors: College Entrance Examinations, Testing, Change, Test Length

Assessing Model Fit of the Generalized Graded Unfolding Model

Peer reviewed
PDF on ERIC

Download full text

Abdulla Alzarouni; R. J. De Ayala – Practical Assessment, Research & Evaluation, 2025

The assessment of model fit in latent trait modeling is an integral part of correctly applying the model. Still the assessment of model fit has been less utilized for ideal point models such as the Generalized Graded Unfolding Models (GGUM). The current study assesses the performance of the relative fit indices "AIC" and "BIC,"…

Descriptors: Goodness of Fit, Models, Statistical Analysis, Sample Size

Two-Phase Content-Balancing CD-CAT Online Item Calibration

Peer reviewed

Direct link

Jing Huang; Yuxiao Zhang; Jason W. Morphew; Jayson M. Nissen; Ben Van Dusen; Hua Hua Chang – Journal of Educational Measurement, 2025

Online calibration estimates new item parameters alongside previously calibrated items, supporting efficient item replenishment. However, most existing online calibration procedures for Cognitive Diagnostic Computerized Adaptive Testing (CD-CAT) lack mechanisms to ensure content balance during live testing. This limitation can lead to uneven…

Descriptors: Adaptive Testing, Computer Assisted Testing, Cognitive Measurement, Test Items

Number of Response Categories and Sample Size Requirements in Polytomous IRT Models

Peer reviewed

Direct link

Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024

Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…

Descriptors: Item Response Theory, Sample Size, Models, Classification

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

Item Reduction of the "Support Intensity Scale" for People with Intellectual Disabilities, Using Machine Learning

Peer reviewed

Direct link

Félix González-Carrasco; Felipe Espinosa Parra; Izaskun Álvarez-Aguado; Sebastián Ponce Olguín; Vanessa Vega Córdova; Miguel Roselló-Peñaloza – British Journal of Learning Disabilities, 2025

Background: The study focuses on the need to optimise assessment scales for support needs in individuals with intellectual and developmental disabilities. Current scales are often lengthy and redundant, leading to exhaustion and response burden. The goal is to use machine learning techniques, specifically item-reduction methods and selection…

Descriptors: Artificial Intelligence, Intellectual Disability, Developmental Disabilities, Individual Needs

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

Designing a Shorter Form of the Big Three Perfectionism Scale: An Application of Ant Colony Optimization

Peer reviewed

Direct link

Kilmen, Sevilay – Journal of Psychoeducational Assessment, 2022

The present study has two main purposes. The first is to create a short form of the BTPS and to evaluate the psychometric properties of the short form. The second is to evaluate the performance of the ant colony optimization procedure and discuss the applicability of the ant colony optimization procedure in creating a short form. Results revealed…

Descriptors: Personality Measures, Test Length, Psychometrics, Undergraduate Students

Effect of Sample Length on MLU in Mandarin-Speaking Hard-of-Hearing Children

Peer reviewed

Direct link

Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024

This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…

Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition

Spanish Validation of Hewitt's Multidimensional Perfectionism Scale-Short Form

Peer reviewed

Direct link

María Vicent; Andrea Fuster; María Pérez-Marco; María del Pilar Aparicio-Flores – Journal of Psychoeducational Assessment, 2025

Although the original long version of the Hewitt Multidimensional Perfectionism Scale (HMPS) has been translated and validated in a Spanish population, no study to date has examined the psychometric properties of a short version of the HMPS with a Spanish-speaking sample. For this reason, the aim of this study is to analyze the psychometric…

Descriptors: Personality Measures, Personality Traits, Spanish, Psychometrics

Chained Computerized Adaptive Testing for the Force Concept Inventory

Peer reviewed

Direct link

Jun-ichiro Yasuda; Michael M. Hull; Naohiro Mae; Kentaro Kojima – Physical Review Physics Education Research, 2025

Although conceptual assessment tests are commonly administered at the beginning and end of a semester, this pre-post approach has inherent limitations. Specifically, education researchers and instructors have limited ability to observe the progression of students' conceptual understanding throughout the course. Furthermore, instructors are limited…

Descriptors: Computer Assisted Testing, Adaptive Testing, Science Tests, Scientific Concepts

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating

Peer reviewed

Direct link

Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021

In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…

Descriptors: Equated Scores, Test Length, Sample Size, Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 43

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	31
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	15
Psychological Assessment	12
International Journal of…	11
International Journal of…	11
Psychometrika	10
Measurement:…	9
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Physical Review Physics…	4
ACT Education Corp.	3
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	424
Journal Articles	405
Reports - Evaluative	125
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	22
Numerical/Quantitative Data	14
Tests/Questionnaires	12
Guides - Non-Classroom	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Program for International…	6
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
ACT Assessment	3
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
More ▼