ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	106

Descriptor

Models	139
Scores	139
Testing	36
Academic Achievement	29
Computer Assisted Testing	28
Hypothesis Testing	28
Comparative Analysis	26
Educational Testing	26
Correlation	24
Evaluation Methods	23
Statistical Analysis	22
Foreign Countries	21
Item Response Theory	21
Achievement Gains	16
Measurement Techniques	16
Teacher Evaluation	16
Standardized Tests	15
Student Evaluation	15
Teacher Effectiveness	15
Test Construction	15
Test Items	15
Prediction	14
Testing Problems	14
Predictor Variables	13
Regression (Statistics)	13
More ▼

Publication Type

Journal Articles	76
Reports - Research	71
Reports - Evaluative	25
Dissertations/Theses -…	15
Reports - Descriptive	15
Speeches/Meeting Papers	11
Opinion Papers	5
Guides - Non-Classroom	4
Collected Works - Proceedings	3
Information Analyses	2
Books	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	19
Elementary Secondary Education	17
Postsecondary Education	14
Secondary Education	14
Elementary Education	13
Middle Schools	10
Junior High Schools	8
High Schools	6
Grade 7	3
Grade 8	3
Adult Education	2
Grade 11	2
Grade 4	2
Grade 6	2
Intermediate Grades	2
Grade 10	1
Grade 2	1
Grade 3	1
Grade 9	1
High School Equivalency…	1
Two Year Colleges	1
More ▼

Audience

Practitioners	3
Researchers	3
Students	1

Location

California	4
Texas	4
New York	3
United States	3
Australia	2
China	2
Denmark	2
Finland	2
Florida	2
France	2
Germany	2
Greece	2
Indonesia	2
Netherlands	2
North Carolina	2
Ohio	2
Pakistan	2
Philippines	2
Singapore	2
Turkey	2
United Kingdom	2
Asia	1
Austria	1
Azerbaijan	1
Belgium	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
No Child Left Behind Act 2001	2
Race to the Top	1

Assessments and Surveys

SAT (College Admission Test)	4
Test of English as a Foreign…	3
ACT Assessment	2
Preliminary Scholastic…	2
Program for International…	2
ACTFL Oral Proficiency…	1
California Achievement Tests	1
Early Childhood Longitudinal…	1
Gates MacGinitie Reading Tests	1
General Educational…	1
International English…	1
Kaufman Test of Educational…	1
National Merit Scholarship…	1
Nelson Denny Reading Tests	1
Stanford Achievement Tests	1
Watson Glaser Critical…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 139 results Save | Export

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

Evaluating Coherence in Writing: Comparing the Capacity of Automated Essay Scoring Technologies

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Journal of Applied Testing Technology, 2022

Automated Essay Scoring (AES) technologies provide innovative solutions to score the written essays with a much shorter time span and at a fraction of the current cost. Traditionally, AES emphasized the importance of capturing the "coherence" of writing because abundant evidence indicated the connection between coherence and the overall…

Descriptors: Computer Assisted Testing, Scoring, Essays, Automation

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Brief Report: Polynomial Regression Mixture Modeling for Heterogeneous Effects of Informant Congruence

Peer reviewed

Direct link

Eunsook Kim; Nathaniel von der Embse – Journal of Experimental Education, 2024

Using data from multiple informants has long been considered best practice in education. However, multiple informants often disagree on similar constructs, complicating decision-making. Polynomial regression and response-surface analysis (PRA) is often used to test the congruence effect between multiple informants on an outcome. However, PRA…

Descriptors: Congruence (Psychology), Information Sources, Best Practices, Regression (Statistics)

Automated Assessment in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses

Peer reviewed

Direct link

Sami Baral; Eamon Worden; Wen-Chiang Lim; Zhuang Luo; Christopher Santorelli; Ashish Gurung; Neil Heffernan – Grantee Submission, 2024

The effectiveness of feedback in enhancing learning outcomes is well documented within Educational Data Mining (EDM). Various prior research have explored methodologies to enhance the effectiveness of feedback to students in various ways. Recent developments in Large Language Models (LLMs) have extended their utility in enhancing automated…

Descriptors: Automation, Scoring, Computer Assisted Testing, Natural Language Processing

Experiment's Persistent Failure in Education Inquiry, and Why It Keeps Failing

Peer reviewed

Direct link

Thomas, Gary – British Educational Research Journal, 2021

Natural scientists are relaxed about the multiple forms experiment takes in their various fields. Yet in education we have for many years constrained our notion of experiment. This methodological circumscription has been self-imposed on the grounds that experiment of a particular, well-defined form offers the clearest evidence of a link between…

Descriptors: Educational Experiments, Models, Intervention, Context Effect

Using Automated Analysis to Assess Middle School Students' Competence with Scientific Argumentation

Peer reviewed

Direct link

Christopher D. Wilson; Kevin C. Haudek; Jonathan F. Osborne; Zoë E. Buck Bracey; Tina Cheuk; Brian M. Donovan; Molly A. M. Stuhlsatz; Marisol M. Santiago; Xiaoming Zhai – Journal of Research in Science Teaching, 2024

Argumentation is fundamental to science education, both as a prominent feature of scientific reasoning and as an effective mode of learning--a perspective reflected in contemporary frameworks and standards. The successful implementation of argumentation in school science, however, requires a paradigm shift in science assessment from the…

Descriptors: Middle School Students, Competence, Science Process Skills, Persuasive Discourse

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Peer reviewed

Direct link

Uto, Masaki; Okano, Masashi – IEEE Transactions on Learning Technologies, 2021

In automated essay scoring (AES), scores are automatically assigned to essays as an alternative to grading by humans. Traditional AES typically relies on handcrafted features, whereas recent studies have proposed AES models based on deep neural networks to obviate the need for feature engineering. Those AES models generally require training on a…

Descriptors: Essays, Scoring, Writing Evaluation, Item Response Theory

Inaccurate Individual Ability Estimates with Three-Parameter Item Response Models in Mixture Settings

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Huber, Chuck – Measurement: Interdisciplinary Research and Perspectives, 2020

It is demonstrated that the popular three-parameter logistic model can lead to markedly inaccurate individual ability level estimates for mixture populations. A theoretically and empirically important setting is initially considered where (a) in one of two subpopulations (latent classes) the two-parameter logistic model holds for each item in a…

Descriptors: Item Response Theory, Models, Measurement Techniques, Item Analysis

Process Mining Combined with Expert Feature Engineering to Predict Efficient Use of Time on High-Stakes Assessments

Peer reviewed
PDF on ERIC

Download full text

Levin, Nathan A. – Journal of Educational Data Mining, 2021

The Big Data for Education Spoke of the NSF Northeast Big Data Innovation Hub and ETS co-sponsored an educational data mining competition in which contestants were asked to predict efficient time use on the NAEP 8th grade mathematics computer-based assessment, based on the log file of a student's actions on a prior portion of the assessment. In…

Descriptors: Learning Analytics, Data Collection, Competition, Prediction

Cramming: Short- and Long-Run Effects. EdWorkingPaper No. 21-444

Download full text

Michael Gilraine; Jeffrey Penney – Annenberg Institute for School Reform at Brown University, 2021

An administrative rule allowed students who failed an exam to retake it shortly after, triggering strong `teach to the test' incentives to raise these students' test scores for the retake. We develop a model that accounts for truncation and find that these students score 0.14 standard deviations higher on the retest. Using a regression…

Descriptors: Tests, Models, Scores, Test Coaching

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Simulation of LD Identification Accuracy Using a Pattern of Processing Strengths and Weaknesses Method with Multiple Measures

Peer reviewed

Direct link

Miciak, Jeremy; Taylor, W. Pat; Stuebing, Karla K.; Fletcher, Jack M. – Journal of Psychoeducational Assessment, 2018

We investigated the classification accuracy of learning disability (LD) identification methods premised on the identification of an intraindividual pattern of processing strengths and weaknesses (PSW) method using multiple indicators for all latent constructs. Known LD status was derived from latent scores; values at the observed level identified…

Descriptors: Accuracy, Learning Disabilities, Classification, Identification

Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

Peer reviewed

Direct link

Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018

Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

Descriptors: Error of Measurement, Testing, Scores, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

ProQuest LLC	14
ETS Research Report Series	4
Applied Measurement in…	3
International Educational…	3
International Journal of…	3
Journal of Educational and…	3
Language Testing	3
National Center for Analysis…	3
Carnegie Foundation for the…	2
Education and Information…	2
Educational Leadership	2
Educational Measurement:…	2
Educational Technology &…	2
Educational Testing Service	2
Educational and Psychological…	2
IEEE Transactions on Learning…	2
Intelligence	2
Journal of Educational…	2
Journal of Experimental…	2
Language Assessment Quarterly	2
Online Submission	2
Psychological Assessment	2
ACT, Inc.	1
Annenberg Institute for…	1
Applied Psychological…	1
More ▼

Hutchison-Lupardus, Tammy R.	3
Snyder, Jennifer E.	3
Arendasy, Martin E.	2
Bormuth, John R.	2
Hadfield, Timothy E.	2
Kane, Michael	2
McCaffrey, Daniel F.	2
Raykov, Tenko	2
Rizavi, Saba	2
Sommer, Markus	2
Uto, Masaki	2
Abuya, Benta A.	1
Adesoji, Francis Adewunmi	1
Airola, Denise Tobin	1
Allen, Scott J.	1
Amrein-Beardsley, Audrey	1
Andjelic, Svetlana	1
Anggrianto, Desi	1
Aomi, Itsuki	1
Aouine, Amina	1
Arief, Mohammad	1
Arnold, Samuel R. C.	1
Aryadoust, Vahid	1
Ashish Gurung	1
More ▼