ERIC - Search Results

Publication Date

In 2025	17
Since 2024	60
Since 2021 (last 5 years)	129
Since 2016 (last 10 years)	220
Since 2006 (last 20 years)	308

Descriptor

Accuracy	308
Evaluation Methods	308
Foreign Countries	76
Comparative Analysis	66
Student Evaluation	52
Models	48
Scores	46
Correlation	37
Feedback (Response)	32
Simulation	31
Classification	30
Prediction	30
English (Second Language)	27
Statistical Analysis	27
Teaching Methods	27
Artificial Intelligence	26
Second Language Learning	25
Validity	25
Item Response Theory	24
Undergraduate Students	24
Computer Software	23
Decision Making	23
Scoring	23
Task Analysis	23
Bayesian Statistics	22
More ▼

Publication Type

Journal Articles	247
Reports - Research	234
Reports - Descriptive	24
Dissertations/Theses -…	22
Reports - Evaluative	21
Tests/Questionnaires	15
Information Analyses	13
Speeches/Meeting Papers	12
Books	3
Collected Works - General	2
Collected Works - Proceedings	1
Guides - General	1
Opinion Papers	1
More ▼

Education Level

Higher Education	77
Postsecondary Education	73
Elementary Education	36
Secondary Education	21
Early Childhood Education	17
Primary Education	14
Elementary Secondary Education	12
Kindergarten	8
Middle Schools	8
Grade 2	7
High Schools	7
Intermediate Grades	5
Grade 1	4
Junior High Schools	4
Preschool Education	4
Adult Education	3
Grade 3	3
Grade 5	3
Grade 4	2
Grade 7	2
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 8	1
More ▼

Audience

Teachers	4
Practitioners	3
Administrators	2
Policymakers	2

Location

Iran	12
China	10
Australia	5
Netherlands	5
Germany	4
United Kingdom (England)	4
Florida	3
Malaysia	3
Turkey	3
Afghanistan	2
California	2
Colombia	2
Hong Kong	2
Indonesia	2
South Korea	2
Spain	2
Texas	2
Thailand	2
United Kingdom	2
United States	2
Wisconsin	2
Arizona	1
Belgium	1
California (Los Angeles)	1
Colorado	1
More ▼

Laws, Policies, & Programs

Family Educational Rights and…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 308 results Save | Export

An Alternative Prior for Estimation in High-Dimensional Settings

Peer reviewed

Direct link

Michael Nagel; Lukas Fischer; Tim Pawlowski; Augustin Kelava – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Bayesian estimations of complex regression models with high-dimensional parameter spaces require advanced priors, capable of addressing both sparsity and multicollinearity in the data. The Dirichlet-horseshoe, a new prior distribution that combines and expands on the concepts of the regularized horseshoe and the Dirichlet-Laplace priors, is a…

Descriptors: Bayesian Statistics, Regression (Statistics), Computation, Statistical Distributions

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

Investigation of Preknowledge Cheating via Joint Hierarchical Modeling Patterns of Response Accuracy and Response Time

Peer reviewed

Direct link

Ebru Balta; Celal Deha Dogan – SAGE Open, 2024

As computer-based testing becomes more prevalent, the attention paid to response time (RT) in assessment practice and psychometric research correspondingly increases. This study explores the rate of Type I error in detecting preknowledge cheating behaviors, the power of the Kullback-Leibler (KL) divergence measure, and the L person fit statistic…

Descriptors: Cheating, Accuracy, Reaction Time, Computer Assisted Testing

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Quantifying Individual Personality Change More Accurately by Regression-Based Change Scores

Peer reviewed

Direct link

Steffen Zitzmann; Lisa Bardach; Kai T. Horstmann; Matthias Ziegler; Martin Hecht – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We investigated three different approaches for quantifying individual change and reporting it back to persons: (a) the common change score, which is obtained by first computing scale scores from two consecutive measurements and then subtract these scores from one another, (b) the ad-hoc approach, which is similar to the former approach but uses…

Descriptors: Personality Change, Personality Measures, Regression (Statistics), Evaluation Methods

An Application of Text Embeddings to Support Alignment of Educational Content Standards

Peer reviewed

Direct link

Reese Butterfuss; Harold Doran – Educational Measurement: Issues and Practice, 2025

Large language models are increasingly used in educational and psychological measurement activities. Their rapidly evolving sophistication and ability to detect language semantics make them viable tools to supplement subject matter experts and their reviews of large amounts of text statements, such as educational content standards. This paper…

Descriptors: Alignment (Education), Academic Standards, Content Analysis, Concept Mapping

Uncertainty in Artificial Neural Network Models: Monte-Carlo Simulations beyond the GUM Boundaries

Peer reviewed

Direct link

A. M. Sadek; Fahad Al-Muhlaki – Measurement: Interdisciplinary Research and Perspectives, 2024

In this study, the accuracy of the artificial neural network (ANN) was assessed considering the uncertainties associated with the randomness of the data and the lack of learning. The Monte-Carlo algorithm was applied to simulate the randomness of the input variables and evaluate the output distribution. It has been shown that under certain…

Descriptors: Monte Carlo Methods, Accuracy, Artificial Intelligence, Guidelines

Adjusting for Misclassification of an Exposure in an Individual Participant Data Meta-Analysis

Peer reviewed

Direct link

de Jong, Valentijn M. T.; Campbell, Harlan; Maxwell, Lauren; Jaenisch, Thomas; Gustafson, Paul; Debray, Thomas P. A. – Research Synthesis Methods, 2023

A common problem in the analysis of multiple data sources, including individual participant data meta-analysis (IPD-MA), is the misclassification of binary variables. Misclassification may lead to biased estimators of model parameters, even when the misclassification is entirely random. We aimed to develop statistical methods that facilitate…

Descriptors: Classification, Meta Analysis, Bayesian Statistics, Evaluation Methods

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

Comparison of the K1 Rule, Parallel Analysis, and the Bass-Ackward Method on Identifying the Number of Factors in Factor Analysis

Peer reviewed

Direct link

Lingbo Tong; Wen Qu; Zhiyong Zhang – Grantee Submission, 2025

Factor analysis is widely utilized to identify latent factors underlying the observed variables. This paper presents a comprehensive comparative study of two widely used methods for determining the optimal number of factors in factor analysis, the K1 rule, and parallel analysis, along with a more recently developed method, the bass-ackward method.…

Descriptors: Factor Analysis, Monte Carlo Methods, Statistical Analysis, Sample Size

Classification Consistency and Accuracy Indices for Simple Structure Multidimensional Item Response Theory Model

Direct link

Huan Liu – ProQuest LLC, 2024

In many large-scale testing programs, examinees are frequently categorized into different performance levels. These classifications are then used to make high-stakes decisions about examinees in contexts such as in licensure, certification, and educational assessments. Numerous approaches to estimating the consistency and accuracy of this…

Descriptors: Classification, Accuracy, Item Response Theory, Decision Making

Self-Assessment Accuracy in Behavior Analytic Contexts

Direct link

Alan J. Kinsella – ProQuest LLC, 2024

An accurate self-assessment repertoire is crucial for maintaining high standards of practice, or a scope of competence, among behavior analysts. However, procedural means to achieve this remain underexplored. Medical communities have investigated these effects and largely found that accuracy in self-assessment is poor, with an inverse relation…

Descriptors: Self Evaluation (Individuals), Accuracy, Behavior, Evaluation Methods

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Estimating Classification Accuracy and Consistency Indices for Multiple Measures with the Simple Structure MIRT Model

Peer reviewed

Direct link

Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023

Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…

Descriptors: Testing, Computation, Classification, Accuracy

Sketching Assessment in Engineering Education: A Systematic Literature Review

Peer reviewed

Direct link

Hillary E. Merzdorf; Donna Jaison; Morgan B. Weaver; Julie Linsey; Tracy Hammond; Kerrie A. Douglas – Journal of Engineering Education, 2024

Background: Sketching exists in many disciplines and varies in how it is assessed, making it challenging to define fundamental sketching skills and the characteristics of a high-quality sketch. For instructors to apply effective strategies for teaching and assessing engineering sketching, a clear summary of the constructs, metrics, and objectives…

Descriptors: Freehand Drawing, Engineering Education, Educational Research, Design

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 21

ProQuest LLC	21
Grantee Submission	13
Education and Information…	8
International Educational…	8
Journal of Educational…	8
Journal of Speech, Language,…	7
ETS Research Report Series	6
Educational and Psychological…	6
Research Synthesis Methods	6
Assessment & Evaluation in…	5
Interactive Learning…	5
Structural Equation Modeling:…	5
Assessment for Effective…	4
IEEE Transactions on Learning…	4
Cognitive Science	3
Developmental Psychology	3
Higher Education Studies	3
Interpreter and Translator…	3
Journal of Autism and…	3
Language Testing in Asia	3
Language, Speech, and Hearing…	3
School Psychology Quarterly	3
American Journal of Evaluation	2
Applied Measurement in…	2
Brain and Cognition	2
More ▼

Chun Wang	5
Gongjun Xu	4
Klingbeil, David A.	3
Van Norman, Ethan R.	3
Birr, Chris	2
Castellano, Katherine E.	2
Cummings, Kelli D.	2
Dockrell, Julie E.	2
Glaspey, Amy M.	2
Grapin, Scott E.	2
Jing Lu	2
Jiwei Zhang	2
Lee, Won-Chan	2
Llosa, Lorena	2
MacLeod, Andrea A. N.	2
Masso, Sarah	2
McCaffrey, Daniel F.	2
Nelson, Peter M.	2
Panadero, Ernesto	2
Resing, Wilma C. M.	2
Revuelta, Javier	2
Smolkowski, Keith	2
Solomon, Benjamin G.	2
Van Dyke, Julie A.	2
Vogelaar, Bart	2
More ▼

Program for International…	4
Autism Diagnostic Observation…	2
Goldman Fristoe Test of…	2
International English…	2
Peabody Picture Vocabulary…	2
Raven Progressive Matrices	2
Dynamic Indicators of Basic…	1
Edinburgh Handedness Inventory	1
Flesch Kincaid Grade Level…	1
Massachusetts Comprehensive…	1
Measures of Academic Progress	1
Mullen Scales of Early…	1
National Assessment of…	1
Preschool Language Scale	1
SAT (College Admission Test)	1
State of Texas Assessments of…	1
Test of English as a Foreign…	1
Vineland Adaptive Behavior…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
Wide Range Achievement Test	1
Woodcock Johnson Tests of…	1
Woodcock Johnson Tests of…	1
More ▼