ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	28

Descriptor

Classification	30
Psychometrics	30
Test Items	30
Models	11
Diagnostic Tests	10
Evaluation Methods	10
Item Response Theory	9
Measurement Techniques	7
Accuracy	6
Evaluation Problems	6
Foreign Countries	6
Measurement	6
Computer Assisted Testing	5
Difficulty Level	5
Educational Assessment	5
Item Analysis	5
Student Evaluation	5
Test Theory	5
Test Validity	5
Cognitive Processes	4
Correlation	4
Definitions	4
English (Second Language)	4
Probability	4
Second Language Learning	4
More ▼

Publication Type

Journal Articles	27
Reports - Research	13
Opinion Papers	6
Reports - Descriptive	6
Reports - Evaluative	5
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Higher Education	5
Elementary Secondary Education	4
Postsecondary Education	4
Elementary Education	2
High Schools	1
Secondary Education	1

Audience

Practitioners	1
Researchers	1

Location

Australia	1
Canada	1
China	1
China (Beijing)	1
Indonesia	1
Taiwan	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Autism Diagnostic Observation…	2
National Assessment of…	1
Raven Progressive Matrices	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

Autism Diagnostic Interview-Revised within DSM-5 Framework: Test of Reliability and Validity in Chinese Children

Peer reviewed

Direct link

Lai, Kelly Y. C.; Yuen, Emily C. W.; Hung, Se Fong; Leung, Patrick W. L. – Journal of Autism and Developmental Disorders, 2022

This study examines the psychometric properties of the Autism Diagnostic Interview-Revised (ADI-R) in the context of DSM-5 in a sample of Chinese children. Using re-mapped ADI-R items and algorithms matched to DSM-5 criteria, and administering to children with autism spectrum disorder (ASD) with and without intellectual disability,…

Descriptors: Autism, Pervasive Developmental Disorders, Diagnostic Tests, Observation

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

Investigating the Classification Accuracy of Rasch and Nominal Weights Mean Equating with Very Small Samples

Peer reviewed

Direct link

Furter, Robert T.; Dwyer, Andrew C. – Applied Measurement in Education, 2020

Maintaining equivalent performance standards across forms is a psychometric challenge exacerbated by small samples. In this study, the accuracy of two equating methods (Rasch anchored calibration and nominal weights mean) and four anchor item selection methods were investigated in the context of very small samples (N = 10). Overall, nominal…

Descriptors: Classification, Accuracy, Item Response Theory, Equated Scores

Exploring Confidence Accuracy and Item Difficulty in Changing Multiple-Choice Answers of Scientific Reasoning Test

Peer reviewed
PDF on ERIC

Download full text

Fadillah, Sarah Meilani; Ha, Minsu; Nuraeni, Eni; Indriyanti, Nurma Yunita – Malaysian Journal of Learning and Instruction, 2023

Purpose: Researchers discovered that when students were given the opportunity to change their answers, a majority changed their responses from incorrect to correct, and this change often increased the overall test results. What prompts students to modify their answers? This study aims to examine the modification of scientific reasoning test, with…

Descriptors: Science Tests, Multiple Choice Tests, Test Items, Decision Making

Applying Psychometric Modeling to Aid Feature Engineering in Predictive Log-Data Analytics: The NAEP EDM Competition

Peer reviewed
PDF on ERIC

Download full text

Zehner, Fabian; Eichmann, Beate; Deribo, Tobias; Harrison, Scott; Bengs, Daniel; Andersen, Nico; Hahnel, Carolin – Journal of Educational Data Mining, 2021

The NAEP EDM Competition required participants to predict efficient test-taking behavior based on log data. This paper describes our top-down approach for engineering features by means of psychometric modeling, aiming at machine learning for the predictive classification task. For feature engineering, we employed, among others, the Log-Normal…

Descriptors: National Competency Tests, Engineering Education, Data Collection, Data Analysis

Computer-Based Assessment of Mathematics into the Twenty-First Century: Pressures and Tensions

Peer reviewed

Direct link

Hoogland, Kees; Tout, Dave – ZDM: The International Journal on Mathematics Education, 2018

In recent decades, technology has influenced various aspects of assessment in mathematics education: (1) supporting the assessment of higher-order thinking skills in mathematics, (2) representing authentic problems from the world around us to use and apply mathematical knowledge and skills, and (3) making the delivery of tests and the analysis of…

Descriptors: Computer Assisted Testing, At Risk Persons, Mathematics Education, Thinking Skills

Developing a Learning Progression for Number Sense Based on the Rule Space Model in China

Peer reviewed

Direct link

Chen, Fu; Yan, Yue; Xin, Tao – Educational Psychology, 2017

The current study focuses on developing the learning progression of number sense for primary school students, and it applies a cognitive diagnostic model, the rule space model, to data analysis. The rule space model analysis firstly extracted nine cognitive attributes and their hierarchy model from the analysis of previous research and the…

Descriptors: Numeracy, Learning Processes, Elementary School Students, Foreign Countries

Adapting the Autistic Behavioural Indicators Instrument (ABII) as a Parent Questionnaire (ABII-PQ)

Peer reviewed

Direct link

Ward, Samantha L.; Sullivan, Karen A.; Gilmore, Linda – Journal of Intellectual & Developmental Disability, 2017

Background: Both parent-report and clinician-administered autism spectrum disorder (ASD) screening instruments are important to accurately inform ASD risk ascertainment. The aim of this study was to adapt a clinician-administered ASD screening instrument, the Autistic Behavioural Indicators Instrument (ABII), as a parent questionnaire equivalent…

Descriptors: Foreign Countries, Autism, Diagnostic Tests, Observation

Applying the Rule Space Model to Develop a Learning Progression for Thermochemistry

Peer reviewed

Direct link

Chen, Fu; Zhang, Shanshan; Guo, Yanfang; Xin, Tao – Research in Science Education, 2017

We used the Rule Space Model, a cognitive diagnostic model, to measure the learning progression for thermochemistry for senior high school students. We extracted five attributes and proposed their hierarchical relationships to model the construct of thermochemistry at four levels using a hypothesized learning progression. For this study, we…

Descriptors: Chemistry, High School Students, Secondary School Science, Correlation

Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013

The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items

Content-Rich versus Content Deficient Video-Based Visuals in L2 Academic Listening Tests: Pilot Study

Peer reviewed

Direct link

Lesnov, Roman Olegovich – International Journal of Computer-Assisted Language Learning and Teaching, 2018

This article compares second language test-takers' performance on an academic listening test in an audio-only mode versus an audio-video mode. A new method of classifying video-based visuals was developed and piloted, which used L2 expert opinions to place the video on a continuum from being content-deficient (not helpful for answering…

Descriptors: Second Language Learning, Second Language Instruction, Video Technology, Classification

Using a Model of Analysts' Judgments to Augment an Item Calibration Process

Peer reviewed

Direct link

Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015

When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…

Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making

Physics Assessment and the Development of a Taxonomy

Peer reviewed
PDF on ERIC

Download full text

Buick, J. M. – European Journal of Physics Education, 2011

Aspects of assessment in physics are considered with the aim of designing assessments that will encourage a deep approach to student learning and will ultimately lead to higher levels of achievement. A range of physics questions are considered and categorized by the level of knowledge and understanding which is require for a successful answer.…

Descriptors: Physics, Taxonomy, Science Achievement, Knowledge Level

A Multilevel Testlet Model for Dual Local Dependence

Peer reviewed

Direct link

Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012

The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…

Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods

Previous Page | Next Page »

Pages: 1 | 2

Measurement:…	6
International Journal of…	2
Journal of Educational…	2
Online Submission	2
Applied Measurement in…	1
British Journal of…	1
CBE - Life Sciences Education	1
Educational Psychology	1
Educational and Psychological…	1
European Journal of Physics…	1
Intelligence	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Autism and…	1
Journal of Educational Data…	1
Journal of Intellectual &…	1
Language Assessment Quarterly	1
Malaysian Journal of Learning…	1
ProQuest LLC	1
Research in Science Education	1
Teachers College Record	1
ZDM: The International…	1
More ▼

Abedi, Jamal	2
Chen, Fu	2
Chen, Yi-Hsin	2
Jiao, Hong	2
Xin, Tao	2
Andersen, Nico	1
Bengs, Daniel	1
Bors, Douglas A.	1
Buick, J. M.	1
Carstensen, Claus H.	1
Deribo, Tobias	1
Dwyer, Andrew C.	1
Eichmann, Beate	1
Fadillah, Sarah Meilani	1
Frey, Andreas	1
Furter, Robert T.	1
Gierl, Mark J.	1
Gilmore, Linda	1
Gorin, Joanna	1
Guo, Yanfang	1
Ha, Minsu	1
Hahnel, Carolin	1
Hancock, Gregory R.	1
Harrison, Scott	1
Hauser, Carl	1
More ▼