ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	27

Descriptor

Accuracy	27
Evaluation Methods	27
Statistical Analysis	27
Comparative Analysis	7
Scores	7
Models	6
Statistical Bias	6
Student Evaluation	6
Classification	5
Error of Measurement	5
Sample Size	5
Correlation	4
Foreign Countries	4
Grammar	4
Item Response Theory	4
Monte Carlo Methods	4
Pretests Posttests	4
Research Methodology	4
Simulation	4
Benchmarking	3
Computation	3
Expertise	3
Hypothesis Testing	3
Interrater Reliability	3
Knowledge Level	3
More ▼

Publication Type

Reports - Research	20
Journal Articles	19
Dissertations/Theses -…	3
Reports - Evaluative	3
Tests/Questionnaires	2
Information Analyses	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Elementary Secondary Education	2
High Schools	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1
Grade 7	1

Audience

Location

Colombia	1
Iran	1
Netherlands	1
New Jersey	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

International English…

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Comparison of the K1 Rule, Parallel Analysis, and the Bass-Ackward Method on Identifying the Number of Factors in Factor Analysis

Peer reviewed

Direct link

Lingbo Tong; Wen Qu; Zhiyong Zhang – Grantee Submission, 2025

Factor analysis is widely utilized to identify latent factors underlying the observed variables. This paper presents a comprehensive comparative study of two widely used methods for determining the optimal number of factors in factor analysis, the K1 rule, and parallel analysis, along with a more recently developed method, the bass-ackward method.…

Descriptors: Factor Analysis, Monte Carlo Methods, Statistical Analysis, Sample Size

An Evaluation of Statistical Methods for Aggregate Patterns of Replication Failure

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jacob M. Schauer; Kaitlyn G. Fitzgerald; Sarah Peko-Spicer; Mena C. R. Whalen; Rrita Zejnullahi; Larry V. Hedges – Grantee Submission, 2021

Several programs of research have sought to assess the replicability of scientific findings in different fields, including economics and psychology. These programs attempt to replicate several findings and use the results to say something about large-scale patterns of replicability in a field. However, little work has been done to understand the…

Descriptors: Statistical Analysis, Research Methodology, Evaluation Methods, Replication (Evaluation)

Estimating the Accuracy of Relative Growth Measures Using Empirical Data

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020

The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…

Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

Document Level Assessment of Document Retrieval Systems in a Pairwise System Evaluation

Peer reviewed
PDF on ERIC

Download full text

Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017

Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…

Descriptors: Information Retrieval, Documentation, Scores, Information Systems

Simultaneous Synthesis of Treatment Effects and Mapping to a Common Scale: An Alternative to Standardisation

Peer reviewed

Direct link

Ades, A. E.; Lu, Guobing; Dias, Sofia; Mayo-Wilson, Evan; Kounali, Daphne – Research Synthesis Methods, 2015

Objective: Trials often may report several similar outcomes measured on different test instruments. We explored a method for synthesising treatment effect information both within and between trials and for reporting treatment effects on a common scale as an alternative to standardisation Study design: We applied a procedure that simultaneously…

Descriptors: Research Methodology, Evaluation Methods, Metabolism, Accuracy

Methods to Estimate the Variance of Some Indices of the Signal Detection Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Suero, Manuel; Privado, Jesús; Botella, Juan – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

A simulation study is presented to evaluate and compare three methods to estimate the variance of the estimates of the parameters d and "C" of the signal detection theory (SDT). Several methods have been proposed to calculate the variance of their estimators, "d'" and "c." Those methods have been mostly assessed by…

Descriptors: Evaluation Methods, Theories, Simulation, Statistical Analysis

Pupillometry as a Tool to Study Expertise in Medicine

Peer reviewed
PDF on ERIC

Download full text

Szulewski, Adam; Kelton, Danielle; Howes, Daniel – Frontline Learning Research, 2017

Background: Pupillometry has been studied as a physiological marker for quantifying cognitive load since the early 1960s. It has been established that small changes in pupillary size can provide an index of the cognitive load of an individual as he/she performs a mental task. The utility of pupillometry as a measure of expertise is less well…

Descriptors: Expertise, Medicine, Eye Movements, Diagnostic Tests

Evaluation of Diagnostic Systems: The Selection of Students at Risk of Academic Difficulties

Peer reviewed

Direct link

Smolkowski, Keith; Cummings, Kelli D. – Assessment for Effective Intervention, 2015

Diagnostic tools can help schools more consistently and fairly match instructional resources to the needs of their students. To ensure the best educational outcome for each child, diagnostic decision-making systems seek to balance time, clarity, and accuracy. However, recent research notes that many educational decisions tend to be made using…

Descriptors: At Risk Students, Educational Diagnosis, Decision Making, Statistical Analysis

Peer versus Teacher Assessment: Implications for CAF Triad Language Ability and Critical Reflections

Peer reviewed

Direct link

Ghahari, Shima; Farokhnia, Farzaneh – International Journal of School & Educational Psychology, 2018

Literature on the learning benefits and interpersonal mechanisms of peer assessment (PA) and teacher assessment (TA) has been inconsistent. As part of a large-scale study, the research reported here has addressed the effect of formative PA on language grammar uptake and complexity, accuracy, and fluency triad scale levels, in comparison both to TA…

Descriptors: Reflection, Formative Evaluation, Peer Evaluation, Grammar

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

Peer reviewed

Direct link

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…

Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy

Optimizing Partial Credit Algorithms to Predict Student Performance

Download full text

Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015

As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…

Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits

Bayesian Asymmetric Regression as a Means to Estimate and Evaluate Oral Reading Fluency Slopes

Peer reviewed

Direct link

Solomon, Benjamin G.; Forsberg, Ole J. – School Psychology Quarterly, 2017

Bayesian techniques have become increasingly present in the social sciences, fueled by advances in computer speed and the development of user-friendly software. In this paper, we forward the use of Bayesian Asymmetric Regression (BAR) to monitor intervention responsiveness when using Curriculum-Based Measurement (CBM) to assess oral reading…

Descriptors: Bayesian Statistics, Regression (Statistics), Least Squares Statistics, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

ProQuest LLC	3
Grantee Submission	2
Applied Measurement in…	1
Assessment for Effective…	1
Cognitive Science	1
College Board	1
Education and Information…	1
Educational Evaluation and…	1
Educational and Psychological…	1
Frontline Learning Research	1
Information Research: An…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of the Scholarship of…	1
PROFILE: Issues in Teachers'…	1
Psicologica: International…	1
Regional Educational…	1
Research Synthesis Methods	1
School Psychology Quarterly	1
Society for Research on…	1
More ▼

Abayeva, Nella F.	1
Ades, A. E.	1
Ambridge, Ben	1
Angus, Megan Hague	1
Bahreini, Kiavash	1
Bell, Athene Cooper	1
Botella, Juan	1
Caicedo Pereira, Martin Javier	1
Castellano, Katherine E.	1
Cheema, Jehanzeb	1
Cohen, Allan	1
Cousineau, Denis	1
Cummings, Kelli D.	1
DeMars, Christine E.	1
Dias, Sofia	1
Donnelly, Chistopher	1
Engelhard, George, Jr.	1
Farokhnia, Farzaneh	1
Forsberg, Ole J.	1
Ghahari, Shima	1
Goldschmidt, Pete	1
Golovachyova, Viktoriya N.	1
Heffernan, Neil	1
Herrmann, Mariesa	1
Howes, Daniel	1
More ▼