ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	257
Since 2006 (last 20 years)	566

Descriptor

Statistical Analysis	744
Scoring	365
Scoring Rubrics	313
Foreign Countries	192
Comparative Analysis	145
Teaching Methods	140
Scores	116
Correlation	114
English (Second Language)	92
Pretests Posttests	91
Second Language Learning	87
Undergraduate Students	85
Evaluation Methods	82
College Students	81
Questionnaires	74
Student Attitudes	73
Student Evaluation	71
Qualitative Research	69
Scoring Formulas	68
Models	64
Second Language Instruction	64
Test Reliability	63
Language Tests	57
Test Construction	54
Test Items	49
More ▼

Education Level

Higher Education	272
Postsecondary Education	212
Elementary Education	97
Secondary Education	89
Middle Schools	60
Elementary Secondary Education	39
High Schools	39
Junior High Schools	39
Early Childhood Education	21
Intermediate Grades	21
Grade 7	20
Grade 5	19
Grade 8	15
Grade 4	13
Primary Education	11
Two Year Colleges	11
Grade 6	10
Preschool Education	9
Adult Education	8
Grade 3	8
Grade 11	6
Grade 9	6
Kindergarten	6
Grade 2	4
Grade 1	2
More ▼

Audience

Researchers	12
Practitioners	7
Teachers	5
Administrators	3
Policymakers	2
Parents	1
Students	1

Location

Turkey	22
California	15
Japan	12
Florida	11
Texas	11
Taiwan	10
New York	9
Spain	9
Australia	8
Canada	8
China	8
Indonesia	8
Illinois	7
Georgia	6
Germany	6
Michigan	6
Virginia	6
Iran	5
New York (New York)	5
New Zealand	5
Ohio	5
United Kingdom	5
United Kingdom (England)	5
Washington	5
Hong Kong	4
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Elementary and Secondary…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	3

Statistical Analysis X

Showing 1 to 15 of 744 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Deriving Expected Values of Model Parameters When Using Sum Scores in Simulation Research

Peer reviewed

Direct link

A. R. Georgeson – Structural Equation Modeling: A Multidisciplinary Journal, 2025

There is increasing interest in using factor scores in structural equation models and there have been numerous methodological papers on the topic. Nevertheless, sum scores, which are computed from adding up item responses, continue to be ubiquitous in practice. It is therefore important to compare simulation results involving factor scores to…

Descriptors: Structural Equation Models, Scores, Factor Analysis, Statistical Bias

Exploring Rater Accuracy Using Unfolding Models Combined with Topic Models: Incorporating Supervised Latent Dirichlet Allocation

Peer reviewed

Direct link

Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022

Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…

Descriptors: Accuracy, Scoring, Statistical Analysis, Models

Generalized Linear Factor Score Regression: A Comparison of Four Methods

Peer reviewed

Direct link

Andersson, Gustaf; Yang-Wallentin, Fan – Educational and Psychological Measurement, 2021

Factor score regression has recently received growing interest as an alternative for structural equation modeling. However, many applications are left without guidance because of the focus on normally distributed outcomes in the literature. We perform a simulation study to examine how a selection of factor scoring methods compare when estimating…

Descriptors: Regression (Statistics), Statistical Analysis, Computation, Scoring

More Power to You: Using Machine Learning to Augment Human Coding for More Efficient Inference in Text-Based Randomized Trials

Peer reviewed

Direct link

Regan Mozer; Luke Miratrix – Grantee Submission, 2024

For randomized trials that use text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by trained human raters. This process, the current standard, is both time-consuming and limiting: even the largest human coding efforts are typically constrained to…

Descriptors: Artificial Intelligence, Coding, Efficiency, Statistical Inference

How Measurement Affects Causal Inference: Attenuation Bias is (Usually) More Important Than Scoring Weights. EdWorkingPaper No. 23-766

Download full text

Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2024

When analyzing treatment effects on test scores, researchers face many choices and competing guidance for scoring tests and modeling results. This study examines the impact of scoring choices through simulation and an empirical application. Results show that estimates from multiple methods applied to the same data will vary because two-step models…

Descriptors: Scores, Statistical Bias, Statistical Inference, Scoring

A Comparison of Manual versus Automated Quantitative Production Analysis of Connected Speech

Peer reviewed

Direct link

Fromm, Davida; Katta, Saketh; Paccione, Mason; Hecht, Sophia; Greenhouse, Joel; MacWhinney, Brian; Schnur, Tatiana T. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: Analysis of connected speech in the field of adult neurogenic communication disorders is essential for research and clinical purposes, yet time and expertise are often cited as limiting factors. The purpose of this project was to create and evaluate an automated program to score and compute the measures from the Quantitative Production…

Descriptors: Speech, Automation, Statistical Analysis, Adults

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

An Investigation of the Comparability of Commission-Approved Teaching Performance Assessment Models. Final Report -- Volume I: Technical Report. No. 120

Download full text

Sinclair, Andrea L., Ed.; Thacker, Arthur, Ed. – Human Resources Research Organization (HumRRO), 2019

California's Commission on Teacher Credentialing (Commission) requires all programs of preliminary multiple and single subject teacher preparation to use a Commission-approved Teaching Performance Assessment (TPA) as one of the program completion requirements for prospective teacher candidates. Three TPA models were approved by the Commission: (1)…

Descriptors: Preservice Teachers, Performance Based Assessment, Models, Credentials

Validation of Rubric Evaluation for Programming Education

Peer reviewed
PDF on ERIC

Download full text

Saito, Daisuke; Yajima, Risei; Washizaki, Hironori; Fukazawa, Yoshiaki – Education Sciences, 2021

In evaluating the learning achievement of programming-thinking skills, the method of using a rubric that describes evaluation items and evaluation stages is widely employed. However, few studies have evaluated the reliability, validity, and consistency of the rubrics themselves. In this study, we introduced a statistical method for evaluating the…

Descriptors: Scoring Rubrics, Computer Science Education, Programming, Reliability

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Reliability of Teams' Game-Related Statistics in Basketball: Number of Games Required and Minimal Detectable Change

Peer reviewed

Direct link

Pérez-Ferreirós, Alexandra; Kalén, Anton; Gómez, Miguel-Ángel; Rey, Ezequiel – Research Quarterly for Exercise and Sport, 2019

In basketball, game-related statistics are the most common measure of performance. However, the literature assessing their reliability is scarce. Purpose: Analyze the number of games required to obtain a good relative and absolute reliability of teams' game-related statistics. Method: A total of 884 games from the 2015-2016 to 2017-2018 seasons of…

Descriptors: Team Sports, Statistics, Reliability, Foreign Countries

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

An Analysis of the Impact of the Change in Scoring System on Home Field Advantage Soccer Leagues

Peer reviewed
PDF on ERIC

Download full text

Inan, Tugbay – Universal Journal of Educational Research, 2018

The impact of the scoring systems on home field advantage in the highest-level Turkish soccer division, Turkish Super League, between the 1959-1960 and 2016-2017 seasons, was aimed to be examined in this study. 2-point system was used in Turkish Soccer Leagues between 1959 and 1987. Since 1987-1988 season, the 3-point system has been started to be…

Descriptors: Foreign Countries, Team Sports, Athletics, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 50

ProQuest LLC	46
Online Submission	19
ETS Research Report Series	18
Educational and Psychological…	15
English Language Teaching	14
Language Testing	13
Journal of Educational…	11
Applied Measurement in…	8
Assessment & Evaluation in…	8
CBE - Life Sciences Education	8
Language Teaching Research	8
Journal of Experimental…	7
Grantee Submission	6
Language Assessment Quarterly	6
Psychometrika	6
International Journal of…	5
International Journal of…	5
Journal of Education and…	5
Journal of Educational…	5
Society for Research on…	5
Advances in Language and…	4
College Board	4
Creativity Research Journal	4
Discourse Processes: A…	4
Educational Testing Service	4
More ▼

Liu, Ou Lydia	4
Livingston, Samuel A.	4
Crossley, Scott A.	3
Frary, Robert B.	3
Lembke, Erica S.	3
Lord, Frederic M.	3
McNamara, Danielle S.	3
Puhan, Gautam	3
Wainer, Howard	3
Wang, Ze	3
Alexander, Patricia A.	2
Allen, Sandra	2
Attali, Yigal	2
Awada, Ghada M.	2
Bays, Cathy L.	2
Belur, Vinetha	2
Bodur, Yasar	2
Boldt, Robert F.	2
Braun, Henry I.	2
Buckenmeyer, Janet	2
Bulunuz, Mizrap	2
Bulunuz, Nermin	2
Chuang, Chi-ching	2
Cross, Lawrence H.	2
More ▼

Reports - Research	547
Journal Articles	512
Tests/Questionnaires	79
Reports - Evaluative	56
Dissertations/Theses -…	47
Speeches/Meeting Papers	37
Reports - Descriptive	20
Numerical/Quantitative Data	10
Information Analyses	6
Guides - Non-Classroom	5
ERIC Digests in Full Text	4
ERIC Publications	4
Guides - Classroom - Learner	4
Guides - Classroom - Teacher	4
Collected Works - Proceedings	3
Books	2
Guides - General	2
Collected Works - General	1
Collected Works - Serials	1
Dissertations/Theses -…	1
Opinion Papers	1
Reference Materials -…	1
Reports -…	1
More ▼

Test of English as a Foreign…	13
SAT (College Admission Test)	12
Graduate Record Examinations	8
National Assessment of…	7
Wechsler Intelligence Scale…	6
ACT Assessment	5
Florida Comprehensive…	3
Advanced Placement…	2
Early Childhood Environment…	2
Flesch Kincaid Grade Level…	2
International English…	2
Medical College Admission Test	2
Modern Language Aptitude Test	2
Torrance Tests of Creative…	2
Wechsler Individual…	2
ACT Interest Inventory	1
Bar Examinations	1
Beery Developmental Test of…	1
Beginning Postsecondary…	1
Bender Visual Motor Gestalt…	1
California Achievement Tests	1
California Critical Thinking…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Differential Aptitude Test	1
More ▼