ERIC - Search Results

Publication Date

In 2025	5
Since 2024	15
Since 2021 (last 5 years)	52
Since 2016 (last 10 years)	131
Since 2006 (last 20 years)	586

Descriptor

Test Items	1553
Test Construction	475
Item Response Theory	441
Computer Assisted Testing	253
Foreign Countries	209
Difficulty Level	206
Test Validity	205
Scores	192
Simulation	185
Item Analysis	184
Comparative Analysis	183
Adaptive Testing	160
Evaluation Methods	160
Scoring	159
Estimation (Mathematics)	156
Multiple Choice Tests	155
Psychometrics	155
Test Format	155
Models	153
Test Reliability	144
Item Bias	133
Mathematical Models	126
Higher Education	124
Test Bias	122
Mathematics Tests	117
More ▼

Publication Type

Reports - Evaluative	1553
Journal Articles	958
Speeches/Meeting Papers	302
Numerical/Quantitative Data	57
Tests/Questionnaires	30
Information Analyses	29
Opinion Papers	23
Reports - Research	19
Guides - Non-Classroom	10
Books	5
Guides - Classroom - Teacher	5
Reports - Descriptive	4
Book/Product Reviews	3
Collected Works - General	3
Collected Works - Proceedings	2
Collected Works - Serials	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - General	1
Historical Materials	1
More ▼

Education Level

Higher Education	102
Elementary Secondary Education	83
Secondary Education	71
Elementary Education	61
Postsecondary Education	48
Grade 8	39
High Schools	29
Middle Schools	29
Grade 5	23
Grade 4	22
Grade 7	18
Junior High Schools	15
Grade 6	14
Grade 10	11
Intermediate Grades	10
Grade 3	8
Adult Education	7
Grade 9	7
Grade 2	6
Early Childhood Education	5
Kindergarten	5
Primary Education	5
Grade 11	4
Preschool Education	4
Grade 12	3
More ▼

Audience

Practitioners	18
Teachers	13
Researchers	12
Administrators	3
Policymakers	3
Community	2

Location

United States	19
Canada	16
Netherlands	16
Australia	15
California	14
Oregon	13
Taiwan	13
New York	12
China	9
United Kingdom	9
Texas	8
Japan	7
Germany	6
United Kingdom (England)	6
Florida	5
Massachusetts	5
Spain	5
Turkey	5
Denmark	4
Illinois	4
Israel	4
Nebraska	4
Ohio	4
South Carolina	4
South Korea	4
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	11
No Child Left Behind Act 2001	11
Elementary and Secondary…	2
Race to the Top	2
Education for All Handicapped…	1
Kentucky Education Reform Act…	1
Lau v Nichols	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 1,553 results Save | Export

Controlling the Speededness of Assembled Test Forms: A Generalization to the Three-Parameter Lognormal Response Time Model

Peer reviewed

Direct link

Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023

When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…

Descriptors: Test Construction, Automation, Models, Test Items

It Ain't near 'Bout Fair: Re-Envisioning the Bias and Sensitivity Review Process from a Justice-Oriented Antiracist Perspective

Peer reviewed

Direct link

Randall, Jennifer – Educational Assessment, 2023

In a justice-oriented antiracist assessment process, attention to the disruption of white supremacy must occur at every stage--from construct articulation to score reporting. An important step in the assessment development process is the item review stage often referred to as Bias/Fairness and Sensitivity Review. I argue that typical approaches to…

Descriptors: Social Justice, Racism, Test Bias, Test Items

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

ChatGPT's Performance Evaluation in Spreadsheets Modelling to Inform Assessments Redesign

Peer reviewed

Direct link

Michelle Cheong – Journal of Computer Assisted Learning, 2025

Background: Increasingly, students are using ChatGPT to assist them in learning and even completing their assessments, raising concerns of academic integrity and loss of critical thinking skills. Many articles suggested educators redesign assessments that are more 'Generative-AI-resistant' and to focus on assessing students on higher order…

Descriptors: Artificial Intelligence, Performance Based Assessment, Spreadsheets, Models

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Artificial Intelligence and Educational Measurement: Opportunities and Threats

Peer reviewed

Direct link

Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024

I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…

Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education

Non-Invariance? An Overstated Problem with Misconceived Causes

Peer reviewed

Direct link

Welzel, Christian; Brunkert, Lennart; Kruse, Stefan; Inglehart, Ronald F. – Sociological Methods & Research, 2023

Scholars study representative international surveys to understand cross-cultural differences in mentality patterns, which are measured via complex multi-item constructs. Methodologists in this field insist with increasing vigor that detecting "non-invariance" in how a construct's items associate with each other in different national…

Descriptors: Cross Cultural Studies, Social Science Research, Factor Analysis, Measurement Techniques

An Effective Deep Learning Pipeline for Improved Question Classification into Bloom's Taxonomy's Domains

Peer reviewed

Direct link

Sharma, Harsh; Mathur, Rohan; Chintala, Tejas; Dhanalakshmi, Samiappan; Senthil, Ramalingam – Education and Information Technologies, 2023

Examination assessments undertaken by educational institutions are pivotal since it is one of the fundamental steps to determining students' understanding and achievements for a distinct subject or course. Questions must be framed on the topics to meet the learning objectives and assess the student's capability in a particular subject. The…

Descriptors: Taxonomy, Student Evaluation, Test Items, Questioning Techniques

Learning to Reuse Distractors to Support Multiple-Choice Question Generation in Education

Peer reviewed

Direct link

Semere Kiros Bitew; Amir Hadifar; Lucas Sterckx; Johannes Deleu; Chris Develder; Thomas Demeester – IEEE Transactions on Learning Technologies, 2024

Multiple-choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, owing to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Test Construction, Test Items

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Rasch Measurement v. Item Response Theory: Knowing When to Cross the Line

Peer reviewed
PDF on ERIC

Download full text

Stemler, Steven E.; Naples, Adam – Practical Assessment, Research & Evaluation, 2021

When students receive the same score on a test, does that mean they know the same amount about the topic? The answer to this question is more complex than it may first appear. This paper compares classical and modern test theories in terms of how they estimate student ability. Crucial distinctions between the aims of Rasch Measurement and IRT are…

Descriptors: Item Response Theory, Test Theory, Ability, Computation

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Leveraging Semantic Facets for Automatic Assessment of Short Free Text Answers

Peer reviewed

Direct link

Qiao, Chen; Hu, Xiao – IEEE Transactions on Learning Technologies, 2023

Free text answers to short questions can reflect students' mastery of concepts and their relationships relevant to learning objectives. However, automating the assessment of free text answers has been challenging due to the complexity of natural language. Existing studies often predict the scores of free text answers in a "black box"…

Descriptors: Computer Assisted Testing, Automation, Test Items, Semantics

A Dialectic on Validity: Explanation-Focused and the Many Ways of Being Human

Peer reviewed
PDF on ERIC

Download full text

Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023

In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…

Descriptors: Test Theory, Test Validity, True Scores, Definitions

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 104

Applied Psychological…	110
Journal of Educational…	102
Educational and Psychological…	79
Applied Measurement in…	63
Psychometrika	37
Educational Measurement:…	31
Journal of Educational and…	28
Behavioral Research and…	17
Educational Assessment	17
Journal of Psychoeducational…	17
International Journal of…	16
Online Submission	16
Measurement:…	15
Language Testing	12
Multivariate Behavioral…	11
Research in Developmental…	10
Studies in Educational…	10
Psychological Assessment	9
Journal of Outcome Measurement	8
Practical Assessment,…	8
Journal of Applied Testing…	7
Journal of Educational…	7
National Center for Research…	7
Educational Testing Service	6
Assessment & Evaluation in…	5
More ▼

van der Linden, Wim J.	26
Stocking, Martha L.	19
Tindal, Gerald	18
Kim, Seock-Ho	17
Wainer, Howard	17
Hambleton, Ronald K.	16
Zwick, Rebecca	15
Alonzo, Julie	14
Dorans, Neil J.	13
Meijer, Rob R.	13
Chang, Hua-Hua	12
Cohen, Allan S.	12
Nandakumar, Ratna	11
Ackerman, Terry A.	10
Mislevy, Robert J.	10
Samejima, Fumiko	9
Bennett, Randy Elliot	8
De Ayala, R. J.	8
Gierl, Mark J.	8
Oshima, T. C.	8
Penfield, Randall D.	8
Plake, Barbara S.	8
Reckase, Mark D.	8
Sireci, Stephen G.	8
More ▼

National Assessment of…	52
Program for International…	34
SAT (College Admission Test)	32
Test of English as a Foreign…	14
Trends in International…	14
Advanced Placement…	13
Graduate Record Examinations	12
ACT Assessment	11
Armed Services Vocational…	9
Law School Admission Test	7
Iowa Tests of Basic Skills	4
Minnesota Multiphasic…	4
Graduate Management Admission…	3
Medical College Admission Test	3
Schools and Staffing Survey…	3
Wechsler Intelligence Scale…	3
Woodcock Johnson Tests of…	3
Work Keys (ACT)	3
Beck Depression Inventory	2
Behavior Assessment System…	2
California Achievement Tests	2
Comprehensive Tests of Basic…	2
Eysenck Personality Inventory	2
Massachusetts Comprehensive…	2
Metropolitan Achievement Tests	2
More ▼