NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)2
Since 2007 (last 20 years)9
Audience
Location
Canada1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 13 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Esfandiari, Mohammad Reza; Riasati, Mohammad Javad; Vaezian, Helia; Rahimi, Forough – Language Testing in Asia, 2018
Background: Validity is a notable concept in language testing which has concerned many researchers and scholars in the field of language testing due to its importance in decision making process. Tests' results always introduce consequences to test takers' lives which emphasizes the need to ensure their validity. Detecting and delineating the…
Descriptors: Computer Assisted Testing, Test Validity, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bejar, Isaac I.; Deane, Paul D.; Flor, Michael; Chen, Jing – ETS Research Report Series, 2017
The report is the first systematic evaluation of the sentence equivalence item type introduced by the "GRE"® revised General Test. We adopt a validity framework to guide our investigation based on Kane's approach to validation whereby a hierarchy of inferences that should be documented to support score meaning and interpretation is…
Descriptors: College Entrance Examinations, Graduate Study, Generalization, Inferences
Peer reviewed Peer reviewed
Direct linkDirect link
Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016
A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy
Peer reviewed Peer reviewed
Direct linkDirect link
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Hendrickson, Amy; Huff, Kristen; Luecht, Richard – Applied Measurement in Education, 2010
Evidence-centered assessment design (ECD) explicates a transparent evidentiary argument to warrant the inferences we make from student test performance. This article describes how the vehicles for gathering student evidence--task models and test specifications--are developed. Task models, which are the basis for item development, flow directly…
Descriptors: Evidence, Test Construction, Measurement, Classification
Rodriguez Jaime, Luis Francisco – ProQuest LLC, 2013
Little is known about students' perceptions of online enrollment processes. Student satisfaction is part of the assessment required for accreditation, but evidence suggests that college administrators are oriented to retention and graduation rates rather than to consumer perception. The purpose of this descriptive quantitative study was to develop…
Descriptors: Enrollment Trends, Enrollment Influences, Higher Education, Student Attitudes
Peer reviewed Peer reviewed
Direct linkDirect link
Graham, Aislin R.; Sherry, Simon B.; Stewart, Sherry H.; Sherry, Dayna L.; McGrath, Daniel S.; Fossum, Kristin M.; Allen, Stephanie L. – Journal of Counseling Psychology, 2010
Perfectionistic concerns (i.e., negative reactions to failures, concerns over others' criticism and expectations, and nagging self-doubts) are a putative risk factor for depressive symptoms. This study proposes and supports the existential model of perfectionism and depressive symptoms (EMPDS), a conceptual model aimed at explaining why…
Descriptors: Foreign Countries, Risk, Depression (Psychology), Models
Peer reviewed Peer reviewed
Direct linkDirect link
Goldschmidt, Pete; Martinez, Jose Felipe; Niemi, David; Baker, Eva L. – Educational Assessment, 2007
In this article we examine empirical evidence on the criterion, predictive, transfer, and fairness aspects of validity of a large-scale language arts performance assessment, referred to as the Performance Assignment (PA). We use multilevel models to avoid biased inferences that might result from the naturally nested data. Specifically, we examine…
Descriptors: Language Arts, Performance Based Assessment, Academic Achievement, Performance Tests
Kane, Michael T. – 1990
The literature on validity provides much more guidance on how to collect various kinds of validity evidence than it does on which kinds of evidence to collect in specific cases. An argument-based approach to validation redresses the balance by linking the kinds of evidence needed to validate a test-score interpretation to the details of the…
Descriptors: Evaluation Methods, Formative Evaluation, Inferences, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Wise, Steven L.; DeMars, Christine E. – Journal of Educational Measurement, 2006
The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low-stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article…
Descriptors: Guessing (Tests), Psychometrics, Inferences, Reaction Time
Sugrue, Brenda – 1993
This report describes a methodology for increasing the validity and reliability of inferences made about the problem-solving ability of science students that is based on performance on different kinds of tests. The generalizable cognitive components of problem solving that might be targeted by assessment are described, and specifications are…
Descriptors: Chemistry, Educational Assessment, Inferences, Models
Peer reviewed Peer reviewed
Kane, Michael T. – Evaluation and the Health Professions, 1992
A proposed model for the validity of measures of professional competence treats validation as the evaluation of inferences drawn from test scores, focusing on evaluation, generalization, and extrapolation. The model is used to indicate strengths and weaknesses of assessments of professional competence: observations of performance, simulations, and…
Descriptors: Competence, Evaluation Methods, Generalization, Inferences