ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	18

Descriptor

Classification	29
Scoring	29
Test Items	29
Item Response Theory	10
Accuracy	8
Test Construction	8
Foreign Countries	6
Reliability	6
Test Format	6
Comparative Analysis	5
Diagnostic Tests	5
Responses	5
Computer Assisted Testing	4
Mathematics Tests	4
Models	4
Multiple Choice Tests	4
Statistical Analysis	4
Test Validity	4
Computation	3
Computer Software	3
Evaluation Methods	3
High School Students	3
Item Analysis	3
Measurement	3
Measurement Techniques	3
More ▼

Source

Educational and Psychological…	6
ProQuest LLC	3
Applied Psychological…	2
Applied Measurement in…	1
Educational Assessment	1
Eurasian Journal of…	1
Interactive Learning…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Autism and…	1
Journal of Psychoeducational…	1
Journal of Technology,…	1
Practical Assessment,…	1
More ▼

Publication Type

Journal Articles	18
Reports - Evaluative	11
Reports - Research	11
Reports - Descriptive	4
Dissertations/Theses -…	3
Books	1
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
High Schools	2
Higher Education	2
Secondary Education	2
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Practitioners

Location

China	2
China (Shanghai)	1
Hong Kong	1
Israel	1
Macau	1
Taiwan (Taipei)	1
Turkey	1
Washington	1

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

Autism Diagnostic Observation…	1
Kaufman Test of Educational…	1
National Assessment of…	1
Program for International…	1
Trends in International…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

An Examination of Individual Ability Estimation and Classification Accuracy under Rapid Guessing Misidentifications

Peer reviewed

Direct link

Rios, Joseph – Applied Measurement in Education, 2022

To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…

Descriptors: Accuracy, Guessing (Tests), Scoring, Classification

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference

Peer reviewed

Direct link

Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022

The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…

Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring

Autism Diagnostic Interview-Revised within DSM-5 Framework: Test of Reliability and Validity in Chinese Children

Peer reviewed

Direct link

Lai, Kelly Y. C.; Yuen, Emily C. W.; Hung, Se Fong; Leung, Patrick W. L. – Journal of Autism and Developmental Disorders, 2022

This study examines the psychometric properties of the Autism Diagnostic Interview-Revised (ADI-R) in the context of DSM-5 in a sample of Chinese children. Using re-mapped ADI-R items and algorithms matched to DSM-5 criteria, and administering to children with autism spectrum disorder (ASD) with and without intellectual disability,…

Descriptors: Autism, Pervasive Developmental Disorders, Diagnostic Tests, Observation

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

EW-KNN: Evaluating Information Technology Courses in High School with a Non-Parametric Cognitive Diagnosis Method

Peer reviewed

Direct link

Wanxue Zhang; Lingling Meng; Bilan Liang – Interactive Learning Environments, 2023

With the continuous development of education, personalized learning has attracted great attention. How to evaluate students' learning effects has become increasingly important. In information technology courses, the traditional academic evaluation focuses on the student's learning outcomes, such as "scores" or "right/wrong,"…

Descriptors: Information Technology, Computer Science Education, High School Students, Scoring

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Validation Trajectory for the Washington Assessment of Risks and Needs of Students

Peer reviewed

Direct link

Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020

The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…

Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013

The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items

A Polytomous Extension of the Generalized Distance Discriminating Method

Peer reviewed

Direct link

Sun, Jianan; Xin, Tao; Zhang, Shumei; de la Torre, Jimmy – Applied Psychological Measurement, 2013

This article proposes a generalized distance discriminating method for test with polytomous response (GDD-P). The new method is the polytomous extension of an item response theory (IRT)-based cognitive diagnostic method, which can identify examinees' ideal response patterns (IRPs) based on a generalized distance index. The similarities between…

Descriptors: Item Response Theory, Cognitive Tests, Diagnostic Tests, Matrices

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

Test Review: Kaufman, A. S., & Kaufman, N. L. (2014), "Kaufman Test of Educational Achievement, Third Edition." Bloomington, MN: NCS Pearson

Peer reviewed

Direct link

Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016

The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…

Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Rudner, Lawrence M.	2
Schulz, E. Matthew	2
Anderson, Lorin W.	1
Becker, Kirk A.	1
Bennett, Randy Elliot	1
Bilan Liang	1
Birenbaum, Menucha	1
Brennan, Robert L.	1
Chang, Hua-Hua	1
Demir, Ergul	1
Deng, Nina	1
Frame, Laura B.	1
French, Brian F.	1
Gifford, Bernard	1
Gorham, Jerry	1
Gotch, Chad M.	1
Haladyna, Thomas M.	1
Han, Suhwa	1
Haynie, Kathleen	1
Hinojosa, Ryan	1
Hung, Se Fong	1
Jiao, Hong	1
Jing Ma	1
Kang, Hyeon-Ah	1
More ▼