ERIC - Search Results

Publication Date

In 2025	7
Since 2024	22
Since 2021 (last 5 years)	71
Since 2016 (last 10 years)	154
Since 2006 (last 20 years)	418

Descriptor

Evaluation Methods	667
Test Items	667
Test Construction	219
Student Evaluation	158
Item Response Theory	142
Foreign Countries	122
Item Analysis	104
Test Validity	104
Computer Assisted Testing	94
Psychometrics	89
Test Bias	85
Scores	83
Measurement Techniques	75
Models	74
Comparative Analysis	73
Simulation	72
Difficulty Level	69
Educational Assessment	56
Elementary Secondary Education	50
Test Reliability	50
Correlation	49
Multiple Choice Tests	49
Statistical Analysis	49
Scoring	46
Higher Education	45
More ▼

Education Level

Higher Education	83
Postsecondary Education	57
Elementary Secondary Education	54
Elementary Education	52
Secondary Education	49
Grade 8	23
Middle Schools	22
High Schools	20
Grade 4	17
Grade 6	14
Junior High Schools	13
Grade 5	12
Intermediate Grades	11
Early Childhood Education	10
Grade 3	7
Grade 10	5
Grade 7	5
Adult Education	4
Grade 12	4
Grade 2	4
Grade 9	4
Preschool Education	4
Primary Education	4
Grade 11	3
Kindergarten	3
More ▼

Audience

Practitioners	38
Teachers	37
Researchers	12
Administrators	9
Students	4
Support Staff	3
Community	1
Parents	1
Policymakers	1

Location

Canada	14
Oregon	9
Germany	8
Turkey	8
United States	8
Australia	7
Netherlands	6
Taiwan	6
United Kingdom	6
United Kingdom (England)	6
China	4
Hong Kong	4
India	4
Italy	4
Japan	4
South Africa	4
California	3
Indonesia	3
Massachusetts	3
North Carolina	3
South Korea	3
Asia	2
Ethiopia	2
Ghana	2
Iran	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	11
No Child Left Behind Act 2001	5
Every Student Succeeds Act…	3
Rehabilitation Act 1973…	3
Education for All Handicapped…	1
Elementary and Secondary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 667 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Exploring Quality Criteria and Evaluation Methods in Automated Question Generation: A Comprehensive Survey

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Education and Information Technologies, 2024

In light of the widespread adoption of technology-enhanced learning and assessment platforms, there is a growing demand for innovative, high-quality, and diverse assessment questions. Automatic Question Generation (AQG) has emerged as a valuable solution, enabling educators and assessment developers to efficiently produce a large volume of test…

Descriptors: Computer Assisted Testing, Test Construction, Test Items, Automation

Students' Acceptance of and Preferences Regarding Online Exams: A Systematic Literature Review

Peer reviewed

Direct link

Arif Cem Topuz; Kinshuk – Educational Technology Research and Development, 2024

Online assessments of learning, or online exams, have become increasingly widespread with the rise of distance learning. Online exams are preferred by many students and are perceived as a quick and easy tool to measure knowledge. On the contrary, some students are concerned about the possibility of cheating and technological difficulties in online…

Descriptors: Computer Assisted Testing, Student Evaluation, Evaluation Methods, Student Attitudes

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Validating a Novel Digital Performance-Based Assessment of Data Literacy: Psychometric and Eye-Tracking Analyses

Peer reviewed

Direct link

Fu Chen; Ying Cui; Alina Lutsyk-King; Yizhu Gao; Xiaoxiao Liu; Maria Cutumisu; Jacqueline P. Leighton – Education and Information Technologies, 2024

Post-secondary data literacy education is critical to students' academic and career success. However, the literature has not adequately addressed the conceptualization and assessment of data literacy for post-secondary students. In this study, we introduced a novel digital performance-based assessment for teaching and evaluating post-secondary…

Descriptors: Performance Based Assessment, College Students, Information Literacy, Evaluation Methods

On the Uses of Process Data in Psychometric Research: Response Process Validity, Theory-Building, and Operational Research

Direct link

Matthew John Davidson – ProQuest LLC, 2022

Digitally-based assessments create opportunities for collecting moment to moment information about how students are responding to assessment items. This information, called log or process data, has long been regarded as a vast and valuable source of data about student performance. Despite repeated assurances of its vastness and value, process data…

Descriptors: Data Use, Psychometrics, Item Response Theory, Test Items

Accessibility of GCSE Science Questions That Ask Students to Create and Augment Visuals: Evidence from Question Omit Rates

Download full text

Santi Lestari – Research Matters, 2025

The ability to draw visual representations such as diagrams and graphs is considered fundamental to science learning. Science exams therefore often include questions which require students to draw a visual representation, or to augment a partially provided one. The design features of such questions (e.g., layout of diagrams, amount of answer…

Descriptors: Science Education, Secondary Education, Visual Aids, Foreign Countries

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Reverse Engineering a Multiple-Choice Test Blueprint to Improve Course Alignment

Peer reviewed
PDF on ERIC

Download full text

Maristela Petrovic-Dzerdz – Collected Essays on Learning and Teaching, 2024

Large introductory classes, with their expansive curriculum, demand assessment strategies that blend efficiency with reliability, prompting the consideration of multiple-choice (MC) tests as a viable option. Crafting a high-quality MC test, however, necessitates a meticulous process involving reflection on assessment format appropriateness, test…

Descriptors: Multiple Choice Tests, Test Construction, Test Items, Alignment (Education)

Confirmatory Factor Analysis of the Teacher Efficacy for Inclusive Practices Scale: A Study of Teachers in Bosnia and Herzegovina

Peer reviewed

Direct link

Edinalda Jakubovic; Haris Memisevic – Journal of Research in Special Educational Needs, 2024

The Teacher Efficacy for Inclusive Practices (TEIP) scale is a widely used instrument for assessing teachers' effectiveness in implementing inclusive practices. The TEIP has not been validated in Bosnia and Herzegovina (BIH). The goal of the present study was to conduct a confirmatory factor analysis (CFA) of the TEIP in a sample of teachers in…

Descriptors: Teacher Effectiveness, Inclusion, Teaching Methods, Foreign Countries

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 45

Educational and Psychological…	38
Journal of Educational…	29
Applied Psychological…	21
Applied Measurement in…	19
ProQuest LLC	14
Online Submission	13
Educational Measurement:…	11
Measurement:…	11
International Journal of…	9
Behavioral Research and…	8
Journal of Educational and…	8
ETS Research Report Series	7
Psychometrika	7
Grantee Submission	6
Practical Assessment,…	6
Studies in Educational…	6
Education and Information…	5
Educational Assessment	5
Journal of Applied Testing…	5
Journal of Chemical Education	5
Journal of Research in…	5
Achieve, Inc.	4
International Journal of…	4
Journal of Psychoeducational…	4
Language Testing	4
More ▼

Tindal, Gerald	9
Alonzo, Julie	8
Hambleton, Ronald K.	8
Lai, Cheng Fei	7
Wang, Wen-Chung	7
Sireci, Stephen G.	6
Nandakumar, Ratna	5
Penfield, Randall D.	5
Reckase, Mark D.	5
van der Linden, Wim J.	5
Abedi, Jamal	4
Davey, Tim	4
Gierl, Mark J.	4
Robitzsch, Alexander	4
Thurlow, Martha L.	4
Eignor, Daniel R.	3
Finch, Holmes	3
Fraillon, Julian	3
Hill, Heather C.	3
Johanson, George A.	3
Kim, Do-Hong	3
Lu, Ying	3
Merz, William R.	3
Rogers, H. Jane	3
More ▼

Journal Articles	452
Reports - Research	305
Reports - Evaluative	160
Reports - Descriptive	102
Speeches/Meeting Papers	71
Tests/Questionnaires	28
Guides - Classroom - Teacher	23
Guides - Non-Classroom	22
Opinion Papers	22
Information Analyses	17
Numerical/Quantitative Data	16
Dissertations/Theses -…	14
Books	10
Guides - Classroom - Learner	5
Guides - General	5
Book/Product Reviews	3
Collected Works - General	3
ERIC Publications	3
Multilingual/Bilingual…	3
Non-Print Media	3
Reference Materials - General	3
Reports - General	3
ERIC Digests in Full Text	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
More ▼

National Assessment of…	13
Program for International…	12
Trends in International…	7
SAT (College Admission Test)	6
Graduate Record Examinations	5
Test of English as a Foreign…	3
California Achievement Tests	2
Dynamic Indicators of Basic…	2
Flesch Kincaid Grade Level…	2
ACT Assessment	1
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Center for Epidemiologic…	1
Florida Comprehensive…	1
Graduate Management Admission…	1
Hidden Figures Test	1
International English…	1
Maslach Burnout Inventory	1
Massachusetts Comprehensive…	1
Mayer Salovey Caruso…	1
Medical College Admission Test	1
North Carolina End of Course…	1
Pennsylvania Educational…	1
Piers Harris Childrens Self…	1
Progress in International…	1
More ▼