Publication Date
In 2025 | 2 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 18 |
Since 2016 (last 10 years) | 48 |
Since 2006 (last 20 years) | 112 |
Descriptor
Classification | 267 |
Test Construction | 267 |
Test Items | 78 |
Test Validity | 57 |
Foreign Countries | 39 |
Evaluation Methods | 34 |
Models | 34 |
Test Reliability | 30 |
Scores | 25 |
Higher Education | 24 |
Multiple Choice Tests | 23 |
More ▼ |
Source
Author
Klausmeier, Herbert J. | 6 |
Haladyna, Thomas M. | 4 |
Bennett, Randy Elliot | 3 |
Downing, Steven M. | 3 |
Gierl, Mark J. | 3 |
Huff, Kristen | 3 |
Liu, Ren | 3 |
Sheehan, Kathleen M. | 3 |
Bradshaw, Laine | 2 |
Diamond, James J. | 2 |
Futagi, Yoko | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 4 |
Administrators | 3 |
Teachers | 3 |
Researchers | 2 |
Location
United States | 5 |
Australia | 4 |
United Kingdom | 4 |
Canada | 3 |
China | 3 |
Philippines | 3 |
Spain | 3 |
Brazil | 2 |
Florida | 2 |
Germany | 2 |
Japan | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024
Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…
Descriptors: Item Response Theory, Sample Size, Models, Classification
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Kuhaneswaran Banujan; Samantha Kumara; Senthan Prasanth; Nirubikaa Ravikumar – International Journal of Education and Development using Information and Communication Technology, 2023
Examinations are one way of evaluating students. To ensure the production of valid exams, frameworks such as Bloom's taxonomy are utilised when preparing questions. Bloom's taxonomy is a well-known framework that categorises educational objectives into six hierarchical levels of cognitive complexity. However, manually categorising exam questions…
Descriptors: Artificial Intelligence, Test Construction, Classification, Foreign Countries
Joanna Williamson – Cambridge University Press & Assessment, 2023
There is a lot of interest in providing detailed reports to schools indicating which skills pupils have mastered and which still need development -- and, more broadly, the knowledge, skills and understanding that pupils have acquired and not yet acquired. Cognitive diagnostic assessment is an approach designed to provide this kind of insight.…
Descriptors: Intelligence Tests, Diagnostic Tests, Test Construction, Mastery Learning
Ge, Yuan – ProQuest LLC, 2022
My dissertation research explored responder behaviors (e.g., demonstrating response styles, carelessness, and possessing misconceptions) that compromise psychometric quality and impact the interpretation and use of assessment results. Identifying these behaviors can help researchers understand and minimize their potentially construct-irrelevant…
Descriptors: Test Wiseness, Response Style (Tests), Item Response Theory, Psychometrics
Fombonne, Eric; MacFarlane, Heather; Salem, Alexandra C. – Journal of Autism and Developmental Disorders, 2021
Recent worldwide epidemiological surveys of autism conducted in 37 countries are reviewed; the median prevalence of autism is 0.97% in 26 high-income countries. Methodological advances and remaining challenges in designing and executing surveys are discussed, including the effects on prevalence of variable case definitions and nosography, of…
Descriptors: Epidemiology, Surveys, Autism, Pervasive Developmental Disorders
Cai, Liuhan; Albano, Anthony D.; Roussos, Louis A. – Measurement: Interdisciplinary Research and Perspectives, 2021
Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item…
Descriptors: Adaptive Testing, Test Items, Item Response Theory, Test Construction
Lovisa Alehagen; Sven Bölte; Melissa H Black – Autism: The International Journal of Research and Practice, 2025
The International Classification of Functioning, Disability, and Health is a biopsychosocial framework of health-related functioning designed to provide a unifying system for health care, social services, education, and policy sectors. Since its publication in 2001, the International Classification of Functioning has been used to guide clinical…
Descriptors: Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Classification, Functional Behavioral Assessment
Marzieh Haghayeghi; Ali Moghadamzadeh; Hamdollah Ravand; Mohamad Javadipour; Hossein Kareshki – Journal of Psychoeducational Assessment, 2025
This study aimed to address the need for a comprehensive assessment tool to evaluate the mathematical abilities of first-grade students through cognitive diagnostic assessment (CDA). The primary challenge involved in this endeavor was to delineate the specific cognitive skills and sub-skills pertinent to first-grade mathematics (FG-M) and to…
Descriptors: Test Construction, Cognitive Measurement, Check Lists, Mathematics Tests
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020
Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…
Descriptors: Test Construction, Test Bias, Classification, Accuracy
Ketabi, Somaye; Alavi, Seyyed Mohammed; Ravand, Hamdollah – International Journal of Language Testing, 2021
Although Diagnostic Classification Models (DCMs) were introduced to education system decades ago, it seems that these models were not employed for the original aims upon which they had been designed. Using DCMs has been mostly common in analyzing large-scale non-diagnostic tests and these models have been rarely used in developing Cognitive…
Descriptors: Diagnostic Tests, Test Construction, Goodness of Fit, Classification
National Centre for Vocational Education Research (NCVER), 2022
"Apprentice and Trainee Outcomes 2021" provides a summary of the outcomes of apprentices and trainees who completed an apprenticeship or traineeship during 2020, with the data collected in mid-2021. The figures are derived from apprentices' and trainees' responses to the National Student Outcomes Survey (SOS), which is an annual survey…
Descriptors: Foreign Countries, Outcomes of Education, Apprenticeships, Trainees
Bradshaw, Laine; Levy, Roy – Educational Measurement: Issues and Practice, 2019
Although much research has been conducted on the psychometric properties of cognitive diagnostic models, they are only recently being used in operational settings to provide results to examinees and other stakeholders. Using this newer class of models in practice comes with a fresh challenge for diagnostic assessment developers: effectively…
Descriptors: Data Interpretation, Probability, Classification, Diagnostic Tests
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions