ERIC - Search Results

Publication Date

In 2025	3
Since 2024	18
Since 2021 (last 5 years)	69
Since 2016 (last 10 years)	161
Since 2006 (last 20 years)	317

Descriptor

Test Length	624
Test Items	218
Item Response Theory	197
Test Construction	149
Sample Size	137
Test Reliability	130
Computer Assisted Testing	117
Test Validity	108
Simulation	107
Adaptive Testing	98
Comparative Analysis	96
Test Format	88
Scores	86
Error of Measurement	75
Statistical Analysis	71
Correlation	68
Foreign Countries	68
Item Analysis	65
Computation	61
Higher Education	61
Models	61
Difficulty Level	57
Accuracy	55
Testing Problems	54
Monte Carlo Methods	51
More ▼

Education Level

Higher Education	44
Postsecondary Education	36
Elementary Education	21
Secondary Education	18
Middle Schools	11
Elementary Secondary Education	10
High Schools	9
Early Childhood Education	8
Junior High Schools	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
Illinois (Chicago)	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Test Length X

Showing 1 to 15 of 624 results Save | Export

Number of Response Categories and Sample Size Requirements in Polytomous IRT Models

Peer reviewed

Direct link

Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024

Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…

Descriptors: Item Response Theory, Sample Size, Models, Classification

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

Item Reduction of the "Support Intensity Scale" for People with Intellectual Disabilities, Using Machine Learning

Peer reviewed

Direct link

Félix González-Carrasco; Felipe Espinosa Parra; Izaskun Álvarez-Aguado; Sebastián Ponce Olguín; Vanessa Vega Córdova; Miguel Roselló-Peñaloza – British Journal of Learning Disabilities, 2025

Background: The study focuses on the need to optimise assessment scales for support needs in individuals with intellectual and developmental disabilities. Current scales are often lengthy and redundant, leading to exhaustion and response burden. The goal is to use machine learning techniques, specifically item-reduction methods and selection…

Descriptors: Artificial Intelligence, Intellectual Disability, Developmental Disabilities, Individual Needs

Designing a Shorter Form of the Big Three Perfectionism Scale: An Application of Ant Colony Optimization

Peer reviewed

Direct link

Kilmen, Sevilay – Journal of Psychoeducational Assessment, 2022

The present study has two main purposes. The first is to create a short form of the BTPS and to evaluate the psychometric properties of the short form. The second is to evaluate the performance of the ant colony optimization procedure and discuss the applicability of the ant colony optimization procedure in creating a short form. Results revealed…

Descriptors: Personality Measures, Test Length, Psychometrics, Undergraduate Students

Effect of Sample Length on MLU in Mandarin-Speaking Hard-of-Hearing Children

Peer reviewed

Direct link

Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024

This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…

Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating

Peer reviewed

Direct link

Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021

In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…

Descriptors: Equated Scores, Test Length, Sample Size, Methods

Investigation of a Multistage Adaptive Test Based on Test Assembly Methods

Peer reviewed
PDF on ERIC

Download full text

Ebru Dogruöz; Hülya Kelecioglu – International Journal of Assessment Tools in Education, 2024

In this research, multistage adaptive tests (MST) were compared according to sample size, panel pattern and module length for top-down and bottom-up test assembly methods. Within the scope of the research, data from PISA 2015 were used and simulation studies were conducted according to the parameters estimated from these data. Analysis results for…

Descriptors: Adaptive Testing, Test Construction, Foreign Countries, Achievement Tests

The Mental Imagery Scale for Art Students: Building and Validating a Short Form

Peer reviewed
PDF on ERIC

Download full text

Handan Narin Kiziltan; Hatice Cigdem Bulut – International Journal of Assessment Tools in Education, 2024

Mental imagery is a vital cognitive skill that significantly influences how reality is perceived while creating art. Its multifaceted nature reveals various dimensions of creative expression, amplifying the inherent complexities of measuring it. This study aimed to shorten the Mental Imagery Scale in Artistic Creativity (MISAC) via the Ant Colony…

Descriptors: Foreign Countries, Undergraduate Students, Art Education, Imagery

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

How Long Should a High Stakes Test Be?

Download full text

Tom Benton – Research Matters, 2024

Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…

Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Development and Initial Validation of the Computer-Based Orthographic Processing Assessment Short Form: An Application of Cognitive Diagnostic Modeling

Peer reviewed

Direct link

Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025

A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…

Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 42

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	28
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	13
Psychological Assessment	12
International Journal of…	11
Psychometrika	10
Measurement:…	9
International Journal of…	8
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
Perceptual and Motor Skills	3
Physical Review Physics…	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	411
Journal Articles	393
Reports - Evaluative	124
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	21
Numerical/Quantitative Data	14
Guides - Non-Classroom	11
Tests/Questionnaires	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Program for International…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
Academic Motivation Scale	1
More ▼