ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	24

Descriptor

Measurement	25
Test Length	25
Test Items	13
Item Response Theory	11
Accuracy	10
Computer Assisted Testing	10
Models	8
Simulation	8
Adaptive Testing	7
Computation	7
Foreign Countries	7
Test Bias	6
Comparative Analysis	5
Correlation	5
Reliability	4
Ability	3
Educational Testing	3
Error of Measurement	3
Psychometrics	3
Test Validity	3
Bayesian Statistics	2
Decision Making	2
Evaluation Methods	2
Factor Structure	2
Item Analysis	2
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	4
Applied Psychological…	3
ProQuest LLC	3
Universal Journal of…	2
Advanced Education	1
International Journal of…	1
Journal of Educational and…	1
Journal of Special Education…	1
Measurement:…	1
Pearson	1
Physical Review Physics…	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	16
Reports - Evaluative	4
Dissertations/Theses -…	3
Reports - Descriptive	2
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Primary Education	1

Audience

Location

Taiwan	2
Germany	1
Japan	1
Thailand	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

Force Concept Inventory

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

Differential Performance of Computerized Adaptive Testing in Students with and without Disabilities -- A Simulation Study

Peer reviewed

Direct link

Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024

Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…

Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

Improving Test Security and Efficiency of Computerized Adaptive Testing for the Force Concept Inventory

Peer reviewed

Direct link

Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022

This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Factor Structure and Measurement Invariance of the Self-Discipline Model Using the Different-Length Questionnaires: Application of Multiple Matrix Sampling

Peer reviewed
PDF on ERIC

Download full text

Damrongpanit, Suntonrapot – Universal Journal of Educational Research, 2019

The purposes of this study were to test the structural validity and to test the parameters invariance of the self-discipline measurement model for good student citizenship among the models, using the data from the 1,047 complete questionnaires and the reducing length questionnaires with multiple matrix sampling technique. The sample size of this…

Descriptors: Factor Structure, Questionnaires, Test Length, Citizenship

Examining the Impact of Differential Item Functioning on Growth Models

Direct link

Samonte, Kelli Marie – ProQuest LLC, 2017

Longitudinal data analysis assumes that scales meet the assumption of longitudinal measurement invariance (i.e., that scales function equivalently across measurement occasions). This simulation study examines the impact of violations to the assumption of longitudinal measurement invariance on growth models and whether modeling the invariance…

Descriptors: Test Bias, Growth Models, Longitudinal Studies, Simulation

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016

The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Improving Measurement Precision of Hierarchical Latent Traits Using Adaptive Testing

Peer reviewed

Direct link

Wang, Chun – Journal of Educational and Behavioral Statistics, 2014

Many latent traits in social sciences display a hierarchical structure, such as intelligence, cognitive ability, or personality. Usually a second-order factor is linearly related to a group of first-order factors (also called domain abilities in cognitive ability measures), and the first-order factors directly govern the actual item responses.…

Descriptors: Measurement, Accuracy, Item Response Theory, Adaptive Testing

Deriving Stopping Rules for Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Chun; Chang, Hua-Hua; Boughton, Keith A. – Applied Psychological Measurement, 2013

Multidimensional computerized adaptive testing (MCAT) is able to provide a vector of ability estimates for each examinee, which could be used to provide a more informative profile of an examinee's performance. The current literature on MCAT focuses on the fixed-length tests, which can generate less accurate results for those examinees whose…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Item Banks

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Wang, Chun	2
Boughton, Keith A.	1
Chang, Hua-Hua	1
Cheng, Ying-Yao	1
Chien, Yuehmei	1
Cui, Ying	1
Culpepper, Steven Andrew	1
Damrongpanit, Suntonrapot	1
Dodd, Barbara G.	1
Eggen, Theo J. H. M.	1
Ellis, Jules L.	1
Embretson, Susan E.	1
Emons, Wilco H. M.	1
Galindo, Jennifer L.	1
Gawliczek, Piotr	1
Geisinger, Kurt F.	1
He, Yinhong	1
Ho, Yi-Hui	1
Huang, Hung-Yu	1
Hull, Michael M.	1
Kieftenbeld, Vincent	1
Kim, Jihye	1
Kruyen, Peter M.	1
Krykun, Viktoriia	1
Lee, HyeSun	1
More ▼