Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 10 |
| Since 2007 (last 20 years) | 18 |
Descriptor
| Classification | 29 |
| Scoring | 29 |
| Test Items | 29 |
| Item Response Theory | 10 |
| Accuracy | 8 |
| Test Construction | 8 |
| Foreign Countries | 6 |
| Reliability | 6 |
| Test Format | 6 |
| Comparative Analysis | 5 |
| Diagnostic Tests | 5 |
| More ▼ | |
Source
Author
| Rudner, Lawrence M. | 2 |
| Schulz, E. Matthew | 2 |
| Anderson, Lorin W. | 1 |
| Becker, Kirk A. | 1 |
| Bennett, Randy Elliot | 1 |
| Bilan Liang | 1 |
| Birenbaum, Menucha | 1 |
| Brennan, Robert L. | 1 |
| Chang, Hua-Hua | 1 |
| Demir, Ergul | 1 |
| Deng, Nina | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 18 |
| Reports - Evaluative | 11 |
| Reports - Research | 11 |
| Reports - Descriptive | 4 |
| Dissertations/Theses -… | 3 |
| Books | 1 |
| Guides - Non-Classroom | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Secondary Education | 3 |
| High Schools | 2 |
| Higher Education | 2 |
| Secondary Education | 2 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Postsecondary Education | 1 |
Audience
| Practitioners | 1 |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
Assessments and Surveys
| Autism Diagnostic Observation… | 1 |
| Kaufman Test of Educational… | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| Trends in International… | 1 |
| Work Keys (ACT) | 1 |
What Works Clearinghouse Rating
Rios, Joseph – Applied Measurement in Education, 2022
To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…
Descriptors: Accuracy, Guessing (Tests), Scoring, Classification
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022
Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…
Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring
Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022
The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…
Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring
Lai, Kelly Y. C.; Yuen, Emily C. W.; Hung, Se Fong; Leung, Patrick W. L. – Journal of Autism and Developmental Disorders, 2022
This study examines the psychometric properties of the Autism Diagnostic Interview-Revised (ADI-R) in the context of DSM-5 in a sample of Chinese children. Using re-mapped ADI-R items and algorithms matched to DSM-5 criteria, and administering to children with autism spectrum disorder (ASD) with and without intellectual disability,…
Descriptors: Autism, Pervasive Developmental Disorders, Diagnostic Tests, Observation
Tingir, Seyfullah – ProQuest LLC, 2019
Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…
Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability
Wanxue Zhang; Lingling Meng; Bilan Liang – Interactive Learning Environments, 2023
With the continuous development of education, personalized learning has attracted great attention. How to evaluate students' learning effects has become increasingly important. In information technology courses, the traditional academic evaluation focuses on the student's learning outcomes, such as "scores" or "right/wrong,"…
Descriptors: Information Technology, Computer Science Education, High School Students, Scoring
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020
The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…
Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques
Demir, Ergul – Eurasian Journal of Educational Research, 2018
Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…
Descriptors: College Students, Cheating, Test Construction, Student Behavior
Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013
The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…
Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items
Sun, Jianan; Xin, Tao; Zhang, Shumei; de la Torre, Jimmy – Applied Psychological Measurement, 2013
This article proposes a generalized distance discriminating method for test with polytomous response (GDD-P). The new method is the polytomous extension of an item response theory (IRT)-based cognitive diagnostic method, which can identify examinees' ideal response patterns (IRPs) based on a generalized distance index. The similarities between…
Descriptors: Item Response Theory, Cognitive Tests, Diagnostic Tests, Matrices
Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012
This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…
Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring
Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016
The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…
Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability
Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model
Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009
For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…
Descriptors: Classification, Reliability, Test Items, Scoring
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
