ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	36

Descriptor

Difficulty Level	100
Item Response Theory	100
Test Items	70
Test Construction	28
Estimation (Mathematics)	22
Simulation	20
Computer Assisted Testing	18
Ability	16
Item Analysis	14
Models	14
Adaptive Testing	13
Comparative Analysis	13
Mathematical Models	13
Error of Measurement	11
Item Bias	11
Scores	11
Equated Scores	10
Psychometrics	10
Test Reliability	10
Goodness of Fit	9
Item Banks	9
Multiple Choice Tests	9
Probability	9
Reading Tests	8
Statistical Analysis	8
More ▼

Publication Type

Reports - Evaluative	100
Journal Articles	54
Speeches/Meeting Papers	30
Numerical/Quantitative Data	8
Information Analyses	1
Reports - Research	1

Education Level

Secondary Education	7
Higher Education	6
Elementary Secondary Education	5
Grade 5	5
Elementary Education	4
Grade 7	4
Postsecondary Education	4
Grade 3	3
Grade 6	3
High Schools	3
Grade 1	2
Grade 4	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Adult Basic Education	1
Adult Education	1
Grade 2	1
Intermediate Grades	1
More ▼

Audience

Location

Australia	2
Oregon	2
United States	2
California	1
Canada	1
Hawaii	1
Israel	1
Portugal	1
South Carolina	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	2
Test of English as a Foreign…	2
ACT Assessment	1
Connecticut Mastery Testing…	1
Hidden Figures Test	1
Medical College Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 100 results Save | Export

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Collaborative Problem-Solving Design in Large-Scale Assessments: Shedding Lights in Sequential Conversation-Based Measurement

Peer reviewed
PDF on ERIC

Download full text

Qiwei He – International Journal of Assessment Tools in Education, 2023

Collaborative problem solving (CPS) is inherently an interactive, conjoint, dual-strand process that considers how a student reasons about a problem as well as how s/he interacts with others to regulate social processes and exchange information (OECD, 2013). Measuring CPS skills presents a challenge for obtaining consistent, accurate, and reliable…

Descriptors: Cooperative Learning, Problem Solving, Test Items, International Assessment

Online Estimation of Student Ability and Item Difficulty with Glicko-2 Rating System on Stratified Data

Peer reviewed
PDF on ERIC

Download full text

Park, Jaesuk – International Educational Data Mining Society, 2021

We propose an adaptation of the Glicko-2 rating system in a K-12 math learning software setting, where variable time intervals between solution attempts and the stratification of student-item pairings by grade levels necessitate modification of the original model. The discrete-time stochastic process underlying the original system has been…

Descriptors: Academic Ability, Difficulty Level, Evaluation Methods, Elementary Secondary Education

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Examination of Indices of High School Performance Based on the Graded Response Model

Peer reviewed

Direct link

Allen, Jeff; Mattern, Krista – Educational Measurement: Issues and Practice, 2019

We examined summary indices of high school performance (coursework, grades, and test scores) based on the graded response model (GRM). The indices varied by inclusion of ACT test scores and whether high school courses were constrained to have the same difficulty and discrimination across groups of schools. The indices were examined with respect to…

Descriptors: High School Students, Academic Achievement, Secondary School Curriculum, Difficulty Level

The "Test of Financial Literacy": Development and Measurement Characteristics

Peer reviewed

Direct link

Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017

The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…

Descriptors: Tests, Money Management, Literacy, High School Students

Assessment of Uncertainty-Infused Scientific Argumentation

Peer reviewed

Direct link

Lee, Hee-Sun; Liu, Ou Lydia; Pallant, Amy; Roohr, Katrina Crotts; Pryputniewicz, Sarah; Buck, Zoë E. – Journal of Research in Science Teaching, 2014

Though addressing sources of uncertainty is an important part of doing science, it has largely been neglected in assessing students' scientific argumentation. In this study, we initially defined a scientific argumentation construct in four structural elements consisting of claim, justification, uncertainty qualifier, and uncertainty…

Descriptors: Persuasive Discourse, Student Evaluation, High School Students, Science Tests

How Much Power and Speed Is Measured in This Test?

Peer reviewed

Direct link

Partchev, Ivailo; De Boeck, Paul; Steyer, Rolf – Assessment, 2013

An old issue in psychological assessment is to what extent power and speed each are measured by a given intelligence test. Starting from accuracy and response time data, an approach based on posterior time limits (cut-offs of recorded response time) leads to three kinds of recoded data: time data (whether or not the response precedes the cut-off),…

Descriptors: Psychological Testing, Intelligence Tests, Time, Item Response Theory

Investigating the Effect of Item Position in Computer-Based Tests

Peer reviewed

Direct link

Li, Feiming; Cohen, Allan; Shen, Linjun – Journal of Educational Measurement, 2012

Computer-based tests (CBTs) often use random ordering of items in order to minimize item exposure and reduce the potential for answer copying. Little research has been done, however, to examine item position effects for these tests. In this study, different versions of a Rasch model and different response time models were examined and applied to…

Descriptors: Computer Assisted Testing, Test Items, Item Response Theory, Models

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

A Generalized Model with Internal Restrictions on Item Difficulty for Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010

In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…

Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

Expected Equating Error Resulting from Incorrect Handling of Item Parameter Drift among the Common Items

Peer reviewed

Direct link

Miller, G. Edward; Fitzpatrick, Steven J. – Educational and Psychological Measurement, 2009

Incorrect handling of item parameter drift during the equating process can result in equating error. If the item parameter drift is due to construct-irrelevant factors, then inclusion of these items in the estimation of the equating constants can be expected to result in equating error. On the other hand, if the item parameter drift is related to…

Descriptors: Equated Scores, Computation, Item Response Theory, Test Items

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Meta-Evaluation in Clinical Anatomy: A Practical Application of Item Response Theory in Multiple Choice Examinations

Peer reviewed

Direct link

Severo, Milton; Tavares, Maria A. Ferreira – Anatomical Sciences Education, 2010

The nature of anatomy education has changed substantially in recent decades, though the traditional multiple-choice written examination remains the cornerstone of assessing students' knowledge. This study sought to measure the quality of a clinical anatomy multiple-choice final examination using item response theory (IRT) models. One hundred…

Descriptors: Evaluation Methods, Anatomy, Item Response Theory, Medical Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Applied Psychological…	9
Journal of Educational…	8
Educational and Psychological…	6
Psychometrika	6
Behavioral Research and…	5
International Journal of…	2
Journal of Educational and…	2
Multivariate Behavioral…	2
ACT, Inc.	1
Alberta Journal of…	1
Anatomical Sciences Education	1
Applied Measurement in…	1
Assessment	1
Cognition	1
Educational Measurement:…	1
Educational Testing Service	1
IEEE Transactions on Education	1
International Educational…	1
International Journal for the…	1
International Journal of…	1
International Journal of…	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Outcome Measurement	1
Journal of Research in…	1
More ▼

Tindal, Gerald	5
Alonzo, Julie	3
De Ayala, R. J.	3
Gershon, Richard C.	3
Clauser, Brian E.	2
Cohen, Allan S.	2
De Boeck, Paul	2
Ferrando, Pere J.	2
Hicks, Marilyn M.	2
Kelderman, Henk	2
Ketterlin-Geller, Leanne R.	2
Kim, Seock-Ho	2
Li, Yuan H.	2
Liu, Kimy	2
Lunz, Mary E.	2
Miller, G. Edward	2
Schnipke, Deborah L.	2
Stocking, Martha L.	2
Wainer, Howard	2
Abdel-fattah, Abdel-fattah A.	1
Adams, Raymond J.	1
Allen, Jeff	1
Allen, Nancy L.	1
Arce, Alvaro J.	1
More ▼