ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5
Since 2021 (last 5 years)	43
Since 2016 (last 10 years)	102
Since 2006 (last 20 years)	159

Descriptor

Test Items	1163
Test Construction	427
Difficulty Level	259
Higher Education	224
Item Response Theory	213
Item Analysis	202
Test Format	172
Test Validity	162
Computer Assisted Testing	156
Multiple Choice Tests	149
Comparative Analysis	147
Test Reliability	126
Foreign Countries	120
Latent Trait Theory	120
Mathematical Models	117
Mathematics Tests	117
Elementary Secondary Education	115
Scores	114
Achievement Tests	112
Estimation (Mathematics)	107
Testing Problems	100
Item Bias	97
Adaptive Testing	95
Item Banks	95
Scoring	87
More ▼

Publication Type

Speeches/Meeting Papers	1163
Reports - Research	719
Reports - Evaluative	302
Journal Articles	85
Reports - Descriptive	72
Tests/Questionnaires	39
Information Analyses	33
Numerical/Quantitative Data	22
Opinion Papers	19
Guides - Non-Classroom	14
Guides - Classroom - Teacher	6
Reports - General	3
Historical Materials	2
Non-Print Media	2
Book/Product Reviews	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
More ▼

Education Level

Secondary Education	42
Higher Education	33
Elementary Education	31
Postsecondary Education	29
Middle Schools	22
High Schools	17
Junior High Schools	17
Elementary Secondary Education	16
Grade 8	12
Intermediate Grades	10
Early Childhood Education	8
Grade 6	7
Primary Education	6
Grade 4	5
Grade 9	5
Grade 5	3
Adult Education	2
Grade 3	2
Grade 10	1
Grade 11	1
Grade 12	1
Grade 2	1
Grade 7	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Researchers	151
Practitioners	20
Teachers	14
Administrators	2
Counselors	2
Policymakers	1
Students	1

Location

Australia	18
Canada	11
Netherlands	10
Turkey	8
United States	8
Germany	6
Israel	6
Texas	4
United Kingdom (England)	4
Virginia	4
California	3
Czech Republic	3
Delaware	3
Florida	3
Indonesia	3
Mexico	3
Michigan	3
New York	3
South Carolina	3
Taiwan	3
Tennessee	3
Alabama	2
Albania	2
Arkansas	2
Georgia	2
More ▼

Laws, Policies, & Programs

Comprehensive Education…	2
Elementary and Secondary…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 1,163 results Save | Export

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

Examining Time Allocation Patterns of Low-Skilled Adults on Program for the International Assessment of Adult Competencies Literacy Items

Peer reviewed

Direct link

Kaldes, Gal; Tighe, Elizabeth; He, Qiwei – AERA Online Paper Repository, 2023

This study used PIAAC process data to examine time-related allocation patterns (time for the first action, total time, last action) of low-skilled, relative to higher-skilled, adults on digital literacy items. Results suggest that less-skilled (Level 2) and higher skilled adults (Levels 3-5) exhibited similar time allocation patterns; however,…

Descriptors: Time Management, Literacy Education, Adult Literacy, Adult Education

Generating Multiple Choice Questions with a Multi-Angle Question Answering Model

Peer reviewed
PDF on ERIC

Download full text

Direct link

Olney, Andrew M. – Grantee Submission, 2022

Multi-angle question answering models have recently been proposed that promise to perform related tasks like question generation. However, performance on related tasks has not been thoroughly studied. We investigate a leading model called Macaw on the task of multiple choice question generation and evaluate its performance on three angles that…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Models

The Impact of the Pandemic on IRT Model/Data Fit

Download full text

Plackner, Christie; Kim, Dong-In – Online Submission, 2022

The application of item response theory (IRT) is almost universal in the development, implementation, and maintenance of large-scale assessments. Therefore, establishing the fit of IRT models to data is essential as the viability of calibration and equating implementations depend on it. In a typical test administration situation, measurement…

Descriptors: COVID-19, Pandemics, Item Response Theory, Goodness of Fit

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Development of Items to Assess Big Ideas of Equivalence and Proportionality

Download full text

Jahangeer Mohamed Jahabar; Toh Tin Lam; Tay Eng Guan; Tong Cherng Luen – Mathematics Education Research Group of Australasia, 2024

Big Ideas can be seen as overarching concepts that occur in various mathematical topics and strands within a syllabus. Within our project on Big Ideas in School Mathematics, we developed instruments to measure two Big Ideas: Equivalence and Proportionality. The instruments we developed seek to assess students' ability to see these Big Ideas as…

Descriptors: Mathematical Concepts, Mathematics Tests, Test Items, Test Construction

Automatic Short Answer Grading with SBERT on Out-of-Sample Questions

Peer reviewed
PDF on ERIC

Download full text

Condor, Aubrey; Litster, Max; Pardos, Zachary – International Educational Data Mining Society, 2021

We explore how different components of an Automatic Short Answer Grading (ASAG) model affect the model's ability to generalize to questions outside of those used for training. For supervised automatic grading models, human ratings are primarily used as ground truth labels. Producing such ratings can be resource heavy, as subject matter experts…

Descriptors: Automation, Grading, Test Items, Generalization

Generating Multiple Choice Questions from a Textbook: LLMs Match Human Performance on Most Metrics

Peer reviewed
PDF on ERIC

Download full text

Andrew M. Olney – Grantee Submission, 2023

Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms

Changing the Success Probability in Computerized Adaptive Testing: A Monte-Carlo Simultion on the Open Matrices Item Bank

Peer reviewed
PDF on ERIC

Download full text

Hanif Akhtar – International Society for Technology, Education, and Science, 2023

For efficiency, Computerized Adaptive Test (CAT) algorithm selects items with the maximum information, typically with a 50% probability of being answered correctly. However, examinees may not be satisfied if they only correctly answer 50% of the items. Researchers discovered that changing the item selection algorithms to choose easier items (i.e.,…

Descriptors: Success, Probability, Computer Assisted Testing, Adaptive Testing

Meta-Learning for Better Learning: Using Meta-Learning Methods to Automatically Label Exam Questions with Detailed Learning Objectives

Peer reviewed
PDF on ERIC

Download full text

Zur, Amir; Applebaum, Isaac; Nardo, Jocelyn Elizabeth; DeWeese, Dory; Sundrani, Sameer; Salehi, Shima – International Educational Data Mining Society, 2023

Detailed learning objectives foster an effective and equitable learning environment by clarifying what instructors expect students to learn, rather than requiring students to use prior knowledge to infer these expectations. When questions are labeled with relevant learning goals, students understand which skills are tested by those questions.…

Descriptors: Equal Education, Prior Learning, Educational Objectives, Chemistry

Promising Practices for Culturally Relevant Assessment: A Systematic Review

Peer reviewed

Direct link

Chioma C. Ezeh – AERA Online Paper Repository, 2023

Culturally relevant assessments (CRA) account for multiple socio-cultural identities, experiences, and values that mediate how students know, think, and respond to test items. Given the diversity of modern classrooms, it is critical that education researchers and practitioners understand and strive to implement CRA practices. This systematic…

Descriptors: Educational Practices, Culturally Relevant Education, Culture Fair Tests, Classroom Techniques

Australian Junior Secondary Students' Approaches to Solving Ratio Problems Prior to Formal Instruction and Their Misconceptions

Download full text

Michelle Cheung; Bronwyn Reid O’Connor; Ben Zunica – Mathematics Education Research Group of Australasia, 2024

Progressing from additive to multiplicative thinking is a key outcome of school mathematics, making ratios an essential topic of study in junior secondary. In this study, 15 Australian Year 8 students were administered a ratio test followed by semi-structured interviews to explore their conceptions of ratio prior to formal instruction. In this…

Descriptors: Secondary School Students, Mathematics Instruction, Foreign Countries, Multiplication

Semi-Supervised Learning Method for Adjusting Biased Item Difficulty Estimates Caused by Nonignorable Missingness under 2PL-IRT Model

Peer reviewed
PDF on ERIC

Download full text

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Grantee Submission, 2020

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Item Response Theory, Test Bias, Test Items

Teacher-Responses: Highlighting Characteristics of Low Response Process Validity for Item(s) Measuring Teachers' Pedagogical Content Knowledge

Peer reviewed
PDF on ERIC

Download full text

Direct link

Martha L. Epstein; Hamza Malik; Kun Wang; Chandra Hawley Orrill – Grantee Submission, 2022

Response Process Validity (RPV) reflects the degree to which items are interpreted as intended by item developers. In this study, teacher responses to constructed response (CR) items to assess pedagogical content knowledge (PCK) of middle school mathematics teachers were evaluated to determine what types of teacher responses signaled weak RPV. We…

Descriptors: Teacher Response, Test Items, Pedagogical Content Knowledge, Mathematics Teachers

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 78

Online Submission	35
Grantee Submission	24
AERA Online Paper Repository	22
Journal of Educational…	19
International Educational…	18
Mathematics Education…	16
Educational and Psychological…	10
North American Chapter of the…	10
International Association for…	9
Applied Measurement in…	8
Applied Psychological…	7
Psychometrika	6
Research-publishing.net	5
Cambridge Assessment	3
College Board	3
Educational Measurement:…	3
Journal of Educational…	3
Pearson	3
ETS Research Report Series	2
Educational Evaluation and…	2
Intelligence	2
International Society for…	2
Academic Medicine	1
Accounting Education	1
African Higher Education…	1
More ▼

Reckase, Mark D.	17
Hambleton, Ronald K.	16
Smith, Richard M.	15
Plake, Barbara S.	14
Kim, Seock-Ho	13
Cohen, Allan S.	12
Ackerman, Terry A.	11
Sykes, Robert C.	11
Thompson, Bruce	9
Chang, Hua-Hua	8
Parshall, Cynthia G.	8
Douglass, James B.	7
Kromrey, Jeffrey D.	7
Pommerich, Mary	7
Schnipke, Deborah L.	7
Sireci, Stephen G.	7
Wise, Steven L.	7
van der Linden, Wim J.	7
Davey, Tim	6
De Ayala, R. J.	6
Dorans, Neil J.	6
Hau, Kit-Tai	6
Herrmann-Abell, Cari F.	6
Huntley, Renee M.	6
More ▼

SAT (College Admission Test)	21
National Assessment of…	20
ACT Assessment	19
Trends in International…	16
Comprehensive Tests of Basic…	8
New Jersey College Basic…	8
Program for International…	7
Stanford Achievement Tests	7
National Teacher Examinations	6
Wechsler Intelligence Scale…	6
Graduate Record Examinations	5
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Armed Services Vocational…	4
California Achievement Tests	4
Metropolitan Achievement Tests	4
Medical College Admission Test	3
Preliminary Scholastic…	3
Raven Progressive Matrices	3
State Trait Anxiety Inventory	3
Test of English as a Foreign…	3
Work Keys (ACT)	3
Bayley Scales of Infant…	2
Embedded Figures Test	2
General Educational…	2
More ▼