ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	11

Descriptor

Correlation	12
Test Validity	12
Inferences	11
Scores	8
Test Items	4
English (Second Language)	3
Evaluation Methods	3
Language Tests	3
Longitudinal Studies	3
Measures (Individuals)	3
Student Evaluation	3
Achievement Tests	2
Generalization	2
Language Proficiency	2
Language Usage	2
Multiple Choice Tests	2
Pedagogical Content Knowledge	2
Reading Comprehension	2
Second Language Learning	2
Statistical Analysis	2
Teacher Characteristics	2
Test Construction	2
Test Reliability	2
Academic Achievement	1
Academic Discourse	1
More ▼

Source

Educational Measurement:…	2
Advances in Health Sciences…	1
Assessment in Education:…	1
Educational Assessment	1
Grantee Submission	1
International Journal of…	1
Language Testing	1
Language Testing in Asia	1
Measurement:…	1
Online Submission	1
Scientific Studies of Reading	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Evaluative	3
Numerical/Quantitative Data	1
Reports - Descriptive	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	4
Elementary Education	2
Higher Education	2
Postsecondary Education	2

Audience

Location

California	1
Idaho	1
Iran	1
Maryland	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Michigan Test of English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

The Construction and Validation of a Q-Matrix for Cognitive Diagnostic Analysis: The Case of the Reading Comprehension Section of the IAUEPT

Peer reviewed
PDF on ERIC

Download full text

Boori, Ali Akbar; Ghazanfari, Mohammad; Ghonsooly, Behzad; Baghaei, Purya – International Journal of Language Testing, 2023

Cognitive diagnostic models (CDMs) have received sustained attention in educational settings because they can be used to operationalize formative assessment to provide diagnostic feedback and inform instruction. A large number of CDMs have been developed over the past few years. An important component of all CDMs is a Q-matrix that specifies a…

Descriptors: Reading Comprehension, Reading Tests, English (Second Language), Islam

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

A Quantitative Analysis of TOEFL iBT Using an Interpretive Model of Test Validity

Peer reviewed

Direct link

Esfandiari, Mohammad Reza; Riasati, Mohammad Javad; Vaezian, Helia; Rahimi, Forough – Language Testing in Asia, 2018

Background: Validity is a notable concept in language testing which has concerned many researchers and scholars in the field of language testing due to its importance in decision making process. Tests' results always introduce consequences to test takers' lives which emphasizes the need to ensure their validity. Detecting and delineating the…

Descriptors: Computer Assisted Testing, Test Validity, Language Tests, English (Second Language)

Development of a Tool to Assess Inference-Making and Reasoning in Biology

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cromley, Jennifer G.; Dai, Ting; Fechter, Tia; Nelson, Frank E.; Van Boekel, Martin; Du, Yang – Grantee Submission, 2021

Making inferences and reasoning with new scientific information is critical for successful performance in biology coursework. Thus, identifying students who are weak in these skills could allow the early provision of additional support and course placement recommendations to help students develop their reasoning abilities, leading to better…

Descriptors: Science Tests, Multiple Choice Tests, Logical Thinking, Inferences

Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

Peer reviewed

Direct link

Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016

Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation

The Dimensionality of Inference Making: Are Local and Global Inferences Distinguishable?

Peer reviewed

Direct link

Muijselaar, Marloes M. L. – Scientific Studies of Reading, 2018

We investigated the dimensionality of inference making in samples of 4- to 9-year-olds (Ns = 416-783) to determine if local and global coherence inferences could be distinguished. In addition, we examined the validity of our experimenter-developed inference measure by comparing with three additional measures of listening comprehension. Multitrait,…

Descriptors: Inferences, Thinking Skills, Young Children, Listening Comprehension

Using Corpus Linguistics to Examine the Extrapolation Inference in the Validity Argument for a High-Stakes Speaking Assessment

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017

Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…

Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences

Validating Measures of Algebra Teacher Subject Matter Knowledge and Pedagogical Content Knowledge

Peer reviewed

Direct link

Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – Educational Assessment, 2012

The purpose of this study was to validate inferences about scores of one task designed to measure subject matter knowledge and three tasks designed to measure aspects of pedagogical content knowledge. Evidence for the validity of inferences was based on two expectations. First, if tasks were sensitive to expertise, we would find group differences.…

Descriptors: Algebra, Mathematics Teachers, Teacher Characteristics, Knowledge Base for Teaching

Building Validity Evidence for Scores on a State-Wide Alternate Assessment: A Contrasting Groups, Multimethod Approach

Peer reviewed

Direct link

Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007

The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…

Descriptors: Inferences, Disabilities, Rating Scales, Eligibility

Effects of Test Administrator Characteristics on Achievement Test Scores

Download full text

Schafer, William D.; Papapolydorou, Maria; Rahman, Taslima; Parker, Lori – Online Submission, 2005

Possible relationships between five test examiner characteristics (gender, race, tenure, experience as a test administrator, and experience as a test developer or scorer) and six student achievement scores (reading, writing, language usage, mathematics, science, and social studies) were studied at the school level in a statewide assessment. The…

Descriptors: Intervals, Academic Achievement, Test Validity, Examiners

Validating the Ecological Assumption: The Relationship of Measure Scores to Classroom Teaching and Student Learning

Peer reviewed

Direct link

Hill, Heather C.; Ball, Deborah Loewenberg; Blunk, Merrie; Goffney, Imani Masters; Rowan, Brian – Measurement: Interdisciplinary Research and Perspectives, 2007

This paper provides a summary of the authors' attempts to uncover links between their measures, classroom mathematics instruction, and student learning. This paper also provides evidence regarding one central critique of their measures: that multiple-choice assessments cannot validly represent the knowledge, skills, and judgment involved in actual…

Descriptors: Teacher Characteristics, Teaching Methods, Correlation, Mathematics Achievement

An, Lily Shiao	1
Baghaei, Purya	1
Baker, Eva L.	1
Ball, Deborah Loewenberg	1
Blunk, Merrie	1
Boori, Ali Akbar	1
Brydges, Ryan	1
Buschang, Rebecca E.	1
Chung, Gregory K. W. K.	1
Cizek, Gregory J.	1
Compton, Elizabeth	1
Cromley, Jennifer G.	1
Dai, Ting	1
Davis, Laurie Laughlin	1
Delacruz, Girlie C.	1
Du, Yang	1
Elliott, Stephen N.	1
Esfandiari, Mohammad Reza	1
Fechter, Tia	1
Ghazanfari, Mohammad	1
Ghonsooly, Behzad	1
Goffney, Imani Masters	1
Hill, Heather C.	1
Ho, Andrew Dean	1
Huiskamp, Maud	1
More ▼