ERIC - Search Results

Publication Date

In 2025	4
Since 2024	7
Since 2021 (last 5 years)	41
Since 2016 (last 10 years)	123
Since 2006 (last 20 years)	368

Descriptor

Correlation	466
Evaluation Methods	466
Scores	111
Foreign Countries	107
Test Validity	107
Student Evaluation	91
Test Reliability	81
Comparative Analysis	77
Academic Achievement	55
Statistical Analysis	55
Measurement Techniques	52
Test Items	49
Tests	49
Test Construction	46
Factor Analysis	45
Psychometrics	43
Measures (Individuals)	42
Reading Tests	42
Standardized Tests	42
Models	40
Children	38
Rating Scales	37
Computer Assisted Testing	35
Language Tests	34
Questionnaires	33
More ▼

Publication Type

Journal Articles	337
Reports - Research	311
Reports - Evaluative	75
Dissertations/Theses -…	29
Speeches/Meeting Papers	20
Tests/Questionnaires	18
Reports - Descriptive	17
Information Analyses	15
Numerical/Quantitative Data	10
Opinion Papers	5
Books	3
Collected Works - Proceedings	3
Guides - Non-Classroom	2
Non-Print Media	2
Reference Materials - General	2
Collected Works - General	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	79
Elementary Education	73
Postsecondary Education	52
Secondary Education	51
Elementary Secondary Education	48
Middle Schools	38
Junior High Schools	28
High Schools	27
Early Childhood Education	21
Grade 8	18
Primary Education	15
Grade 6	14
Grade 4	13
Intermediate Grades	12
Grade 5	11
Grade 7	9
Grade 3	8
Grade 10	7
Preschool Education	7
Adult Education	6
Grade 11	5
Grade 2	5
Grade 9	5
Kindergarten	5
Grade 1	4
More ▼

Audience

Researchers	7
Teachers	5
Practitioners	1

Location

Australia	13
Florida	11
China	10
United Kingdom	10
United States	10
Canada	9
Turkey	9
Arizona	8
Germany	7
Netherlands	7
South Korea	7
California	6
Illinois	6
Japan	6
Pennsylvania	6
Texas	6
Massachusetts	5
North Carolina	5
Sweden	5
Singapore	4
Spain	4
Taiwan	4
Hong Kong	3
India	3
Indiana	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	6
Elementary and Secondary…	1
Head Start	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 466 results Save | Export

A Correlation of the "i"-Ready Assessment Program for Mathematics and Mississippi Academic Assessment Program for Mathematics Scores for Grades 6-8 in One Central Mississippi School

Direct link

Angela G. Cooley – ProQuest LLC, 2024

Educators rely on a conglomeration of assessment instruments to appraise, measure, monitor, and document evidence of students' academic readiness, learning progressions, acquisition of interdisciplinary knowledge, and extensive educational needs. The Mississippi Academic Assessment Program for Mathematics (MAAP-M) and the i-Ready Assessment…

Descriptors: Middle Schools, Grade 6, Grade 7, Grade 8

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Properties of a Combined Measure of Reading and Writing: The Assessment of Writing, Self-Monitoring, and Reading (AWSM Reader)

Peer reviewed

Direct link

Gioia, Anthony R.; Ahmed, Yusra; Woods, Steven P.; Cirino, Paul T. – Reading and Writing: An Interdisciplinary Journal, 2023

There is significant overlap between reading and writing, but no known standardized measure assesses these jointly. The goal of the present study is to evaluate the properties of a novel measure, the Assessment of Writing, Self-Monitoring, and Reading (AWSM Reader), that simultaneously evaluates both reading comprehension and writing. In doing so,…

Descriptors: Reading Writing Relationship, Writing Evaluation, Self Evaluation (Individuals), Executive Function

Motivations for Using the Item Response Theory Nominal Response Model to Rank Responses to Multiple-Choice Items

Peer reviewed

Direct link

Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022

Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…

Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests

Moving from Association to Mediation: A Sociocultural Approach to Assessment

Peer reviewed

Direct link

Eun, Barohny; Knotek, Steven E. – Research in Education, 2022

A Vygotskian approach to assessment is proposed by invoking the distinction between the development of lower and higher psychological functions. Higher psychological functions are specifically human and develop with the use of cultural tools via mediation. Accordingly, a distinction is made between tests that are based on association, which have…

Descriptors: Evaluation Methods, Sociocultural Patterns, Psychological Patterns, Teaching Methods

Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis

Peer reviewed

Direct link

D'Urso, E. Damiano; Tijmstra, Jesper; Vermunt, Jeroen K.; De Roover, Kim – Educational and Psychological Measurement, 2023

Assessing the measurement model (MM) of self-report scales is crucial to obtain valid measurements of individuals' latent psychological constructs. This entails evaluating the number of measured constructs and determining which construct is measured by which item. Exploratory factor analysis (EFA) is the most-used method to evaluate these…

Descriptors: Factor Analysis, Measurement Techniques, Self Evaluation (Individuals), Psychological Patterns

Performance-Based Speaking Tests: Possibilities in Local Language Testing

Peer reviewed
PDF on ERIC

Download full text

Dimova, Slobodanka – Language Teaching Research Quarterly, 2022

Drawing on Glenn Fulcher's extensive work in performance-based language assessment of speaking, this paper explores the assessment of L2 speaking ability in local language testing contexts. For that purpose, I review Fulcher's influential work that highlights the relationship between the speaking construct, the task, the performance, and the…

Descriptors: Language Tests, Speech Communication, Performance Based Assessment, Second Language Learning

How Are Standard-Maintaining Activities Based on Comparative Judgement Affected by Mismarking in the Script Evidence?

Download full text

Williamson, Joanna – Research Matters, 2022

Providing evidence that can inform awarding is an important application of Comparative Judgement (CJ) methods in high-stakes qualifications. The process of marking scripts is not changed, but CJ methods can assist in the maintenance of standards from one series to another by informing decisions about where to place grade boundaries or cut scores.…

Descriptors: Standards, Grading, Decision Making, Comparative Analysis

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

PBL Student Assessment: Consistency of Different Evaluation Methods in a Computing Faculty

Peer reviewed

Direct link

Henrique Mohallem Paiva; Flávia Maria Santoro; Victor Takashi Hayashi; Bianca Cassemiro Lima – IEEE Transactions on Education, 2025

Contribution: This article analyzes student assessment within a computing faculty employing a full project-based learning (PBL) approach. Examining 2078 final grades across 60 classes and periods, the study reveals a significant correlation between graded self-studies, exams, and projects. This result contributes to understanding the reliability…

Descriptors: Student Evaluation, Computer Science Education, College Faculty, Correlation

A Practical Comparison of Selected Methods of Evaluating Multiple-Choice Options through Classical Item Analysis

Peer reviewed
PDF on ERIC

Download full text

Malec, Wojciech; Krzeminska-Adamek, Malgorzata – Practical Assessment, Research & Evaluation, 2020

The main objective of the article is to compare several methods of evaluating multiple-choice options through classical item analysis. The methods subjected to examination include the tabulation of choice distribution, the interpretation of trace lines, the point-biserial correlation, the categorical analysis of trace lines, and the investigation…

Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Item Analysis

Exploration of Factors Affecting the Added Value of Test Subscores

Peer reviewed

Direct link

Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019

Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…

Descriptors: Value Added Models, Scores, Sample Size, Correlation

The Fluctuating Effect of Thinking on Language Performance: New Evidence for the Island Ridge Curve

Peer reviewed

Direct link

Cai, Yuyang; Chen, Huilin – Language Assessment Quarterly, 2022

Thinking skills play a critical role in determining language performance. Recent advancement in cognitive diagnostic modelling (CDM) provides a powerful tool for obtaining fine-grained information regarding these thinking skills during reading. Studies are scant, however, exploring the relations between thinking skills and language performance,…

Descriptors: Evaluation Methods, Language Proficiency, Second Language Learning, Reading Processes

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 32

ProQuest LLC	29
Educational and Psychological…	11
Journal of Autism and…	10
ETS Research Report Series	9
Assessment & Evaluation in…	7
Grantee Submission	7
Journal of Psychoeducational…	7
Online Submission	6
Applied Psychological…	5
National Center for Education…	5
Advances in Health Sciences…	4
Assessment for Effective…	4
Intelligence	4
Language, Speech, and Hearing…	4
Psychometrika	4
Regional Educational…	4
Research in Developmental…	4
Autism: The International…	3
College Board	3
International Journal of…	3
Journal of the American…	3
Mathematica Policy Research,…	3
Measurement and Evaluation in…	3
Practical Assessment,…	3
Psychological Assessment	3
More ▼

Chiang, Hanley	7
Burkander, Paul	5
Gill, Brian	5
Hallgren, Kristin	5
Herrmann, Mariesa	5
Speroni, Cecilia	5
Wellington, Alison	5
Bridgeman, Brent	4
Cheng, Liying	3
Elliott, Stephen N.	3
Berliner, David C.	2
Booker, Kevin	2
Bruch, Julie	2
Clark, John L. D.	2
Clarke, Ben	2
Cook, Kyle DeMeo	2
Cowan, James	2
Cox, Troy L.	2
Crosson, Amy C.	2
Davey, Tim	2
Goldhaber, Dan	2
Greller, Sara	2
Haymond, Kelly	2
Jordan, Nancy C.	2
More ▼

Wechsler Intelligence Scale…	10
Graduate Record Examinations	7
Program for International…	7
Autism Diagnostic Observation…	5
Dynamic Indicators of Basic…	5
ACT Assessment	4
Peabody Picture Vocabulary…	4
SAT (College Admission Test)	4
Stanford Achievement Tests	4
Test of English as a Foreign…	4
Clinical Evaluation of…	3
Gates MacGinitie Reading Tests	3
Mullen Scales of Early…	3
Progress in International…	3
Trends in International…	3
Aberrant Behavior Checklist	2
Behavior Assessment System…	2
Child Behavior Checklist	2
Conners Rating Scales	2
Early Childhood Longitudinal…	2
Iowa Tests of Basic Skills	2
Kaufman Brief Intelligence…	2
Preliminary Scholastic…	2
Raven Progressive Matrices	2
State Trait Anxiety Inventory	2
More ▼