Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Classification | 10 |
Test Theory | 10 |
Mastery Tests | 4 |
Bayesian Statistics | 3 |
Cutting Scores | 3 |
Decision Making | 3 |
Test Results | 3 |
Foreign Countries | 2 |
Item Banks | 2 |
Item Response Theory | 2 |
Mathematical Models | 2 |
More ▼ |
Source
Applied Psychological… | 1 |
ETS Research Report Series | 1 |
Gifted Child Quarterly | 1 |
Journal of Applied Testing… | 1 |
Journal of Educational… | 1 |
Journal of School Psychology | 1 |
Author
van der Linden, Wim J. | 2 |
Assouline, Susan G. | 1 |
Becker, Kirk A. | 1 |
Chen, Yi-Hsin | 1 |
Divgi, D. R. | 1 |
Dorans, Neil J. | 1 |
Haladyna, Tom | 1 |
Hoffman, R. Gene | 1 |
Kao, Shu-chuan | 1 |
LeBeau, Brandon | 1 |
Lupkowski-Shoplik, Ann | 1 |
More ▼ |
Publication Type
Reports - Research | 10 |
Journal Articles | 6 |
Speeches/Meeting Papers | 3 |
Numerical/Quantitative Data | 1 |
Education Level
Elementary Education | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022
Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…
Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring
Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019
The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…
Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics
LeBeau, Brandon; Assouline, Susan G.; Mahatmya, Duhita; Lupkowski-Shoplik, Ann – Gifted Child Quarterly, 2020
This study investigated the application of item response theory (IRT) to expand the range of ability estimates for gifted (hereinafter referred to as high-achieving) students' performance on an above-level test. Using a sample of fourth- to sixth-grade high-achieving students (N = 1,893), we conducted a study to compare estimates from two…
Descriptors: Item Response Theory, Test Theory, Academically Gifted, High Achievement

Divgi, D. R. – Applied Psychological Measurement, 1980
The dependence of reliability indices for mastery tests on mean and cutoff scores was examined in the case of three decision-theoretic indices. Dependence of kappa on mean and cutoff scores was opposite to that of the proportion of correct decisions, which was linearly related to average threshold loss. (Author/BW)
Descriptors: Classification, Cutting Scores, Mastery Tests, Test Reliability
Mapuranga, Raymond; Dorans, Neil J.; Middleton, Kyndra – ETS Research Report Series, 2008
In many practical settings, essentially the same differential item functioning (DIF) procedures have been in use since the late 1980s. Since then, examinee populations have become more heterogeneous, and tests have included more polytomously scored items. This paper summarizes and classifies new DIF methods and procedures that have appeared since…
Descriptors: Test Bias, Educational Development, Evaluation Methods, Statistical Analysis
Hoffman, R. Gene; Wise, Lauress L. – 2000
Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…
Descriptors: Achievement, Classification, Observation, Probability

Strein, William – Journal of School Psychology, 1990
Compared the Woodcock-Johnson Tests of Cognitive Ability (WJTCA) score profiles of different cultural groups, using 442 White and 435 non-White subjects drawn from the kindergarten through grade 12 subset of WJTCA standardization data. Determined that data allowed for classification of the subtests by both curve and cultural effects criteria.…
Descriptors: Classification, Cognitive Ability, Cognitive Measurement, Elementary School Students
van der Linden, Wim J. – 1987
The use of Bayesian decision theory to solve problems in test-based decision making is discussed. Four basic decision problems are distinguished: (1) selection; (2) mastery; (3) placement; and (4) classification, the situation where each treatment has its own criterion. Each type of decision can be identified as a specific configuration of one or…
Descriptors: Bayesian Statistics, Classification, Decision Making, Foreign Countries
van der Linden, Wim J. – 1985
This paper reviews recent research in the Netherlands on the application of decision theory to test-based decision making about personnel selection and student placement. The review is based on an earlier model proposed for the classification of decision problems, and emphasizes an empirical Bayesian framework. Classification decisions with…
Descriptors: Bayesian Statistics, Classification, Cutting Scores, Decision Making
Haladyna, Tom; Roid, Gale – 1980
The problems associated with misclassifying students when pass-fail decisions are based on test scores are discussed. One protection against misclassification is to set a confidence interval around the cutting score. Those whose scores fall above the interval are passed; those whose scores fall below the interval are failed; and those whose scores…
Descriptors: Bayesian Statistics, Classification, Comparative Analysis, Criterion Referenced Tests