ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Test Theory	10
Models	4
Multiple Choice Tests	3
Test Items	3
Classification	2
Computation	2
Maximum Likelihood Statistics	2
National Competency Tests	2
Psychometrics	2
Responses	2
Sampling	2
Scores	2
Statistical Analysis	2
Test Bias	2
Test Reliability	2
Accuracy	1
Bias	1
Cheating	1
Cognitive Measurement	1
College Entrance Examinations	1
Comparative Analysis	1
Democracy	1
Diagnostic Tests	1
Educational Assessment	1
Educational Policy	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	10
Reports - Descriptive	4
Reports - Research	4
Reports - Evaluative	2

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Sweden	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Armed Services Vocational…	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Latent Trait Item Response Models for Continuous Responses

Peer reviewed

Direct link

Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024

A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…

Descriptors: Item Response Theory, Responses, Scores, Models

Modeling Partial Knowledge in Multiple-Choice Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Kentaro Fukushima; Nao Uchida; Kensuke Okada – Journal of Educational and Behavioral Statistics, 2025

Diagnostic tests are typically administered in a multiple-choice (MC) format due to their advantages of objectivity and time efficiency. The MC-deterministic input, noisy "and" gate (DINA) family of models, a representative class of cognitive diagnostic models for MC items, efficiently and parsimoniously estimates the mastery profiles of…

Descriptors: Diagnostic Tests, Cognitive Measurement, Multiple Choice Tests, Educational Assessment

Expertise on Offer: Why Isn't Anyone Buying?

Peer reviewed

Direct link

Braun, Henry – Journal of Educational and Behavioral Statistics, 2023

It is a much-lamented fact that research with the potential to inform or influence education policy instead remains policy inert. There are many reasons for this frustrating state of affairs, including a lack of strategic thinking on the part of researchers on how to successfully accomplish outreach--as opposed to communication with peers…

Descriptors: Educational Policy, Educational Research, Educational Researchers, Persuasive Discourse

A Cognitive Diagnosis Model for Continuous Response

Peer reviewed

Direct link

Minchen, Nathan D.; de la Torre, Jimmy; Liu, Ying – Journal of Educational and Behavioral Statistics, 2017

Nondichotomous response models have been of greater interest in recent years due to the increasing use of different scoring methods and various performance measures. As an important alternative to dichotomous scoring, the use of continuous response formats has been found in the literature. To assess finer-grained skills or attributes and to…

Descriptors: Models, Psychometrics, Test Theory, Maximum Likelihood Statistics

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Sampling Variability and Axioms of Classical Test Theory

Peer reviewed

Direct link

Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 2011

Many well-known equations in classical test theory are mathematical identities in populations of individuals but not in random samples from those populations. First, test scores are subject to the same sampling error that is familiar in statistical estimation and hypothesis testing. Second, the assumptions made in derivation of formulas in test…

Descriptors: Test Theory, Equations (Mathematics), Scores, Sampling

Toward a Coherent View of Reliability in Test Theory.

Peer reviewed

Li, Heng; Wainer, Howard – Journal of Educational and Behavioral Statistics, 1997

Provides a general mathematical framework is provided that can be specialized to four different reliability coefficients. Consideration of this general framework makes it easier to convey to students the individual character of the formulations of reliability and the extent of their underlying similarity. (SLD)

Descriptors: Mathematical Models, Reliability, Teaching Methods, Test Theory

Detecting Answer Copying when the Regular Response Process Follows a Known Response Model

Peer reviewed

Direct link

van der Linden, Wim J.; Sotaridona, Leonardo – Journal of Educational and Behavioral Statistics, 2006

A statistical test for detecting answer copying on multiple-choice items is presented. The test is based on the exact null distribution of the number of random matches between two test takers under the assumption that the response process follows a known response model. The null distribution can easily be generalized to the family of distributions…

Descriptors: Test Items, Multiple Choice Tests, Cheating, Responses

Test Equating from Biased Samples, with Application to the Armed Services Vocational Aptitude Battery.

Peer reviewed

Little, Roderick J. A.; Rubin, Donald B. – Journal of Educational and Behavioral Statistics, 1994

Equating a new standard test to an old reference test is considered when samples for equating are not randomly selected from the target population of test takers, identifying two problems from equating from biased samples. An empirical example with data from the Armed Services Vocational Aptitude Battery illustrates the approach. (SLD)

Descriptors: Equated Scores, Military Personnel, Sampling, Statistical Analysis

Braun, Henry	1
Gerhard Tutz	1
Kensuke Okada	1
Kentaro Fukushima	1
Li, Heng	1
Little, Roderick J. A.	1
Liu, Ying	1
Longford, Nicholas T.	1
Minchen, Nathan D.	1
Nao Uchida	1
Pascal Jordan	1
Ramsay, James O.	1
Rubin, Donald B.	1
Sotaridona, Leonardo	1
Wainer, Howard	1
Wiberg, Marie	1
Zimmerman, Donald W.	1
de la Torre, Jimmy	1
van der Linden, Wim J.	1
More ▼