ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Adaptive Testing	13
Reliability	13
Computer Assisted Testing	10
Test Items	7
Error of Measurement	4
Item Response Theory	3
Simulation	3
Test Length	3
Ability	2
Accuracy	2
Classification	2
Computation	2
Estimation (Mathematics)	2
Goodness of Fit	2
Reading Tests	2
Scaling	2
Scores	2
Selection	2
Test Bias	2
Test Construction	2
Test Format	2
Validity	2
Ability Grouping	1
Adults	1
Bayesian Statistics	1
More ▼

Source

Applied Psychological…	2
International Journal of…	2
Assessment	1
Canadian Journal of School…	1
Educational and Psychological…	1
Journal of Educational and…	1
Journal of Technology,…	1
Measurement:…	1
OECD Publishing	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Evaluative	6
Reports - Research	5
Collected Works - General	1
Dissertations/Theses -…	1
Reports - General	1

Education Level

High Schools	2
Elementary Education	1
Grade 1	1
Grade 11	1
Grade 12	1
Grade 2	1
Kindergarten	1
Primary Education	1
Secondary Education	1

Audience

Location

Australia	1
Austria	1
Belgium	1
Canada	1
Chile	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
Ireland	1
Italy	1
Japan	1
Netherlands	1
Norway	1
Poland	1
Russia	1
Slovakia	1
South Korea	1
Spain	1
Sweden	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Classification Consistency and Results Reporting of a Digital-First Computer-Adaptive Language Proficiency Test

Direct link

Ramsey Lee Cardwell – ProQuest LLC, 2022

The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…

Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing

Attribute-Level Item Selection Method for DCM-CAT

Peer reviewed

Direct link

Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…

Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing

Improving Measurement Precision of Hierarchical Latent Traits Using Adaptive Testing

Peer reviewed

Direct link

Wang, Chun – Journal of Educational and Behavioral Statistics, 2014

Many latent traits in social sciences display a hierarchical structure, such as intelligence, cognitive ability, or personality. Usually a second-order factor is linearly related to a group of first-order factors (also called domain abilities in cognitive ability measures), and the first-order factors directly govern the actual item responses.…

Descriptors: Measurement, Accuracy, Item Response Theory, Adaptive Testing

Technical Report of the Survey of Adult Skills (PIAAC)

Direct link

OECD Publishing, 2013

The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…

Descriptors: International Assessment, Adults, Skills, Test Construction

The Value of Item Response Theory in Clinical Assessment: A Review

Peer reviewed

Direct link

Thomas, Michael L. – Assessment, 2011

Item response theory (IRT) and related latent variable models represent modern psychometric theory, the successor to classical test theory in psychological assessment. Although IRT has become prevalent in the measurement of ability and achievement, its contributions to clinical domains have been less extensive. Applications of IRT to clinical…

Descriptors: Item Response Theory, Psychological Evaluation, Reliability, Error of Measurement

Development and Application of Detection Indices for Measuring Guessing Behaviors and Test-Taking Effort in Computerized Adaptive Testing

Peer reviewed

Direct link

Chang, Shu-Ren; Plake, Barbara S.; Kramer, Gene A.; Lien, Shu-Mei – Educational and Psychological Measurement, 2011

This study examined the amount of time that different ability-level examinees spend on questions they answer correctly or incorrectly across different pretest item blocks presented on a fixed-length, time-restricted computerized adaptive testing (CAT). Results indicate that different ability-level examinees require different amounts of time to…

Descriptors: Evidence, Test Items, Reaction Time, Adaptive Testing

A Monte Carlo Simulation Investigating the Validity and Reliability of Ability Estimation in Item Response Theory with Speeded Computer Adaptive Tests

Peer reviewed

Direct link

Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010

Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…

Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing

Technical Adequacy and Cost Benefit of Four Measures of Early Literacy

Peer reviewed

Direct link

McBride, James R.; Ysseldyke, Jim; Milone, Michael; Stickney, Eric – Canadian Journal of School Psychology, 2010

Technical adequacy and information/cost return were examined for four early reading measures: the Dynamic Indicators of Basic Early Literacy Skills (DIBELS), STAR Early Literacy (SEL), Group Reading Assessment and Diagnostic Evaluation (GRADE), and the Texas Primary Reading Inventory (TPRI). All four assessments were administered to the same…

Descriptors: Early Reading, Reading Achievement, Adaptive Testing, Phonemic Awareness

A "Rearrangement Procedure" for Scoring Adaptive Tests with Review Options

Peer reviewed

Direct link

Papanastasiou, Elena C.; Reckase, Mark D. – International Journal of Testing, 2007

Because of the increased popularity of computerized adaptive testing (CAT), many admissions tests, as well as certification and licensure examinations, have been transformed from their paper-and-pencil versions to computerized adaptive versions. A major difference between paper-and-pencil tests and CAT from an examinee's point of view is that in…

Descriptors: Simulation, Adaptive Testing, Computer Assisted Testing, Test Items

The Effect of Using Item Parameters Calibrated from Paper Administrations in Computer Adaptive Test Administrations

Peer reviewed
PDF on ERIC

Download full text

Direct link

Pommerich, Mary – Journal of Technology, Learning, and Assessment, 2007

Computer administered tests are becoming increasingly prevalent as computer technology becomes more readily available on a large scale. For testing programs that utilize both computer and paper administrations, mode effects are problematic in that they can result in examinee scores that are artificially inflated or deflated. As such, researchers…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Scores

Estimation of Reliability Coefficients Using the Test Information Function and Its Modifications.

Peer reviewed

Samejima, Fumiko – Applied Psychological Measurement, 1994

The reliability coefficient is predicted from the test information function (TIF) or two modified TIF formulas and a specific trait distribution. Examples illustrate the variability of the reliability coefficient across different trait distributions, and results are compared with empirical reliability coefficients. (SLD)

Descriptors: Adaptive Testing, Error of Measurement, Estimation (Mathematics), Reliability

Some Reliability Estimates for Computerized Adaptive Tests.

Peer reviewed

Nicewander, W. Alan; Thomasson, Gary L. – Applied Psychological Measurement, 1999

Derives three reliability estimates for the Bayes modal estimate (BME) and the maximum-likelihood estimate (MLE) of theta in computerized adaptive tests (CATs). Computes the three reliability estimates and the true reliabilities of both BME and MLE for seven simulated CATs. Results show the true reliabilities for BME and MLE to be nearly identical…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

The Rasch Model for Dichotomous Items: Theory, Applications and a Computer Program. No. 63.

Download full text

Gustafsson, Jan-Eric – 1977

The Rasch model for test analysis is described and compared with two-parameter and three-parameter latent-trait models. Conditional maximum likelihood equations for estimating item parameters are derived, and estimates of person parameters are described together with their confidence intervals. Goodness of fit tests are discussed, including a…

Descriptors: Adaptive Testing, Computer Programs, Equated Scores, Error of Measurement

Bao, Yu	1
Bradshaw, Laine	1
Chang, Shu-Ren	1
Gustafsson, Jan-Eric	1
Kramer, Gene A.	1
Lien, Shu-Mei	1
McBride, James R.	1
Milone, Michael	1
Nicewander, W. Alan	1
Papanastasiou, Elena C.	1
Plake, Barbara S.	1
Pommerich, Mary	1
Ramsey Lee Cardwell	1
Reckase, Mark D.	1
Samejima, Fumiko	1
Sass, D. A.	1
Schmitt, T. A.	1
Stickney, Eric	1
Sullivan, J. R.	1
Thomas, Michael L.	1
Thomasson, Gary L.	1
Walker, C. M.	1
Wang, Chun	1
Ysseldyke, Jim	1
More ▼