ERIC - Search Results

Publication Date

In 2025	5
Since 2024	6
Since 2021 (last 5 years)	18
Since 2016 (last 10 years)	51
Since 2006 (last 20 years)	96

Descriptor

Inferences	114
Test Validity	114
Scores	44
Test Construction	34
Test Reliability	33
Test Items	26
Evaluation Methods	22
Foreign Countries	19
English (Second Language)	18
Language Tests	16
Second Language Learning	16
Evidence	13
Reading Comprehension	13
Reading Tests	13
Multiple Choice Tests	12
Psychometrics	12
Correlation	11
Measures (Individuals)	11
Models	11
Student Evaluation	11
Elementary School Students	10
Generalization	10
Comparative Analysis	9
Computer Assisted Testing	9
Scoring	9
More ▼

Publication Type

Journal Articles	93
Reports - Research	62
Reports - Evaluative	33
Reports - Descriptive	10
Opinion Papers	8
Information Analyses	5
Tests/Questionnaires	5
Dissertations/Theses -…	4
Speeches/Meeting Papers	3

Education Level

Higher Education	21
Postsecondary Education	17
Elementary Education	16
Secondary Education	14
Elementary Secondary Education	13
Grade 5	6
High Schools	5
Middle Schools	5
Grade 4	4
Intermediate Grades	3
Adult Education	2
Early Childhood Education	2
Grade 2	2
Grade 3	2
Grade 6	2
Grade 7	2
Junior High Schools	2
Primary Education	2
Grade 12	1
Grade 9	1
More ▼

Audience

Practitioners	1
Teachers	1

Location

Iran	4
Canada	3
Idaho	3
California	2
Florida	2
France	2
Malaysia	2
United Kingdom (England)	2
United States	2
Wisconsin	2
Australia	1
China	1
Colombia	1
Mexico	1
Nigeria	1
Ohio	1
Taiwan	1
Turkey (Istanbul)	1
United Kingdom (Reading)	1
Washington	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	6
Gates MacGinitie Reading Tests	2
Graduate Record Examinations	2
International English…	2
Program for International…	2
Iowa Tests of Basic Skills	1
Michigan Test of English…	1
National Assessment of…	1
Progress in International…	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing 1 to 15 of 114 results Save | Export

MSC-Trans: A Multi-Feature-Fusion Network with Encoding Structure for Student Engagement Detecting

Peer reviewed

Direct link

Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025

Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…

Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology

Measurement Issues in Causal Inference

Peer reviewed

Direct link

Benjamin R. Shear; Derek C. Briggs – Asia Pacific Education Review, 2024

Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four…

Descriptors: Measurement Techniques, Inferences, COVID-19, Pandemics

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

A General Framework for the Validation of Embedded Formative Assessment

Peer reviewed

Direct link

Hopster-den Otter, Dorien; Wools, Saskia; Eggen, Theo J. H. M.; Veldkamp, Bernard P. – Journal of Educational Measurement, 2019

In educational practice, test results are used for several purposes. However, validity research is especially focused on the validity of summative assessment. This article aimed to provide a general framework for validating formative assessment. The authors applied the argument-based approach to validation to the context of formative assessment.…

Descriptors: Formative Evaluation, Test Validity, Scores, Inferences

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Constructing and Validating a Q-Matrix for Cognitive Diagnostic Analysis of the Listening Comprehension Section of the IELTS

Peer reviewed
PDF on ERIC

Download full text

Seyedeh Azadeh Ghiasian; Fatemeh Hemmati; Seyyed Mohammad Alavi; Afsar Rouhi – International Journal of Language Testing, 2025

A critical component of cognitive diagnostic models (CDMs) is a Q-matrix that stipulates associations between items of a test and their required attributes. The present study aims to develop and empirically validate a Q-matrix for the listening comprehension section of the International English Language Testing System (IELTS). To this end, a…

Descriptors: Test Items, Listening Comprehension Tests, English (Second Language), Language Tests

The Uses of Process Data in Large-Scale Educational Assessments. OECD Education Working Papers. No. 286

Direct link

Maddox, Bryan – OECD Publishing, 2023

The digital transition in educational testing has introduced many new opportunities for technology to enhance large-scale assessments. These include the potential to collect and use log data on test-taker response processes routinely, and on a large scale. Process data has long been recognised as a valuable source of validation evidence in…

Descriptors: Measurement, Inferences, Test Reliability, Computer Assisted Testing

Fitting MD Analysis in an Argument-Based Validity Framework for Writing Assessment: Explanation and Generalization Inferences for the ECPE

Peer reviewed

Direct link

Yan, Xun; Staples, Shelley – Language Testing, 2020

The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…

Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity

Using Eye-Tracking Data as Part of the Validity Argument for Multiple-Choice Questions: A Demonstration

Peer reviewed

Direct link

Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Journal of Educational Measurement, 2021

Eye-tracking technology can create a record of the location and duration of visual fixations as a test-taker reads test questions. Although the cognitive process the test-taker is using cannot be directly observed, eye-tracking data can support inferences about these unobserved cognitive processes. This type of information has the potential to…

Descriptors: Eye Movements, Test Validity, Multiple Choice Tests, Cognitive Processes

Validation of Rating Processes within an Argument-Based Framework

Peer reviewed

Direct link

Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018

Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…

Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Bayesian Assessment of Undergraduate Students about the Real Function Mathematical Concept

Peer reviewed
PDF on ERIC

Download full text

Rodríguez-Vásquez, Flor Monserrat; Ariza-Hernandez, Francisco J. – EURASIA Journal of Mathematics, Science and Technology Education, 2021

The evaluation of learning in mathematics is a worldwide problem, therefore, new methods are required to assess the understanding of mathematical concepts. In this paper, we propose to use the Item Response Theory to analyze the understanding level of undergraduate students about the real function mathematical concept. The Bayesian approach was…

Descriptors: Bayesian Statistics, Mathematics Education, Item Response Theory, Undergraduate Students

Prescribing Structure for Validation Arguments: Elemental, Structural, and Ecological Validity

Peer reviewed

Direct link

Jacobson, Erik; Svetina, Dubravka – Applied Measurement in Education, 2019

Contingent argument-based approaches to validity require a unique argument for each use, in contrast to more prescriptive approaches that identify the common kinds of validity evidence researchers should consider for every use. In this article, we evaluate our use of an approach that is both prescriptive "and" argument-based to develop a…

Descriptors: Test Validity, Test Items, Test Construction, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Language Testing	7
Assessment in Education:…	6
Measurement:…	6
Educational Measurement:…	5
Grantee Submission	5
Applied Measurement in…	4
Educational Assessment	4
International Journal of…	4
Journal of Educational…	4
ProQuest LLC	4
ETS Research Report Series	2
Educational Researcher	2
International Journal of…	2
Language Testing in Asia	2
Advances in Health Sciences…	1
Advances in Language and…	1
American Psychologist	1
Annenberg Institute for…	1
Asia Pacific Education Review	1
Assessment for Effective…	1
Canadian Modern Language…	1
Computer Assisted Language…	1
EURASIA Journal of…	1
Educational Evaluation and…	1
Educational Media…	1
More ▼

Briggs, Derek C.	3
Baker, Eva L.	2
Bejar, Isaac I.	2
Blunk, Merrie	2
Bulut, Okan	2
Butterfuss, Reese	2
Cheng, Liying	2
Crawford, Angela	2
Cromley, Jennifer G.	2
Dai, Ting	2
Ercikan, Kadriye	2
Fechter, Tia	2
Fox, Janna	2
Goldschmidt, Pete	2
Haertel, Geneva	2
Hill, Heather C.	2
Johnson, Evelyn S.	2
Kane, Michael T.	2
Kendeou, Panayiota	2
Kim, Jasmine	2
McMaster, Kristen L.	2
Mislevy, Robert J.	2
Moylan, Laura A.	2
Nelson, Frank E.	2
More ▼