ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	27

Descriptor

Scores	31
Inferences	25
Validity	8
Language Tests	6
Statistical Inference	6
Testing	6
Comparative Analysis	5
Test Items	5
Test Validity	5
Academic Achievement	4
Accountability	4
English (Second Language)	4
Intervention	4
Models	4
Pretests Posttests	4
Regression (Statistics)	4
Second Language Learning	4
Statistical Analysis	4
Test Use	4
Barriers	3
Construct Validity	3
Control Groups	3
Correlation	3
Definitions	3
Educational Testing	3
More ▼

Publication Type

Reports - Descriptive	31
Journal Articles	26
Books	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Elementary Education	2
Grade 5	1
Higher Education	1

Audience

Researchers

Location

Australia	1
Colorado (Boulder)	1
Oregon	1
South Korea	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Race to the Top	1

Assessments and Surveys

Test of English as a Foreign…	2
ACTFL Oral Proficiency…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

The Alternative Factors Leading to Replication Crisis: Prediction and Evaluation

Peer reviewed

Direct link

Gregory Chernov – Evaluation Review, 2025

Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…

Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure

Missing Data: An Update on the State of the Art

Peer reviewed
PDF on ERIC

Download full text

Direct link

Craig K. Enders – Grantee Submission, 2023

The year 2022 is the 20th anniversary of Joseph Schafer and John Graham's paper titled "Missing data: Our view of the state of the art," currently the most highly cited paper in the history of "Psychological Methods." Much has changed since 2002, as missing data methodologies have continually evolved and improved; the range of…

Descriptors: Data, Research, Theories, Regression (Statistics)

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

Maintaining Access to a Large-Scale Test of Academic Language Proficiency during the Pandemic: The Launch of TOEFL iBT Home Edition

Peer reviewed

Direct link

Papageorgiou, Spiros; Manna, Venessa F. – Language Assessment Quarterly, 2021

The TOEFL iBT test was introduced in 2005 to better reflect the language demands of real-life academic tasks than did previous versions of the test. The task-based design of the test was intended to support the interpretation of its scores as a trustworthy measure of international students' ability to use English in an academic environment. Until…

Descriptors: Academic Language, COVID-19, Pandemics, Scores

Development and Evaluation of Assessments for Counseling Professionals

Peer reviewed

Direct link

Lenz, A. Stephen; Wester, Kelly L. – Measurement and Evaluation in Counseling and Development, 2017

It is imperative that counselors understand how to critically evaluate assessments before using them to make clinical decisions. This evaluation can be conducted through integrating the 5 sources of validity. Each source of validity is discussed, along with methods to appraise psychometric quality, throughout this special issue.

Descriptors: Counseling Techniques, Educational Assessment, Psychological Evaluation, Clinical Diagnosis

Propensity Score Analysis Statistical Methods and Applications. Second Edition. Advanced Quantitative Techniques in the Social Sciences. Volume 11

Direct link

Guo, Shenyang; Fraser, Mark W. – SAGE Publications Ltd (CA), 2014

Fully updated to reflect the most recent changes in the field, the Second Edition of "Propensity Score Analysis" provides an accessible, systematic review of the origins, history, and statistical foundations of propensity score analysis, illustrating how it can be used for solving evaluation and causal-inference problems. With a strong…

Descriptors: Probability, Scores, Statistical Analysis, Causal Models

Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

Peer reviewed

Direct link

Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016

Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation

Learning What Works in Sensory Disabilities: Establishing Causal Inference

Peer reviewed
PDF on ERIC

Download full text

Cooney, John B.; Young, John, III; Luckner, John L.; Ferrell, Kay Alicyn – Journal of Visual Impairment & Blindness, 2015

This article is intended to assist teachers and researchers in designing studies that examine the efficacy of a particular intervention or strategy with students with sensory disabilities. Ten research designs that can establish causal inference (the ability to attribute any effects to the intervention) with and without randomization are discussed.

Descriptors: Intervention, Sensory Integration, Disabilities, Inferences

Rater Cognition: Implications for Validity

Peer reviewed

Direct link

Bejar, Issac I. – Educational Measurement: Issues and Practice, 2012

The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…

Descriptors: Scores, Inferences, Validity, Scoring

Teacher Value Added as a Measure of Program Quality: Interpret with Caution

Peer reviewed

Direct link

Floden, Robert E. – Journal of Teacher Education, 2012

Many states now possess the data and statistical methods that can produce teacher value-added scores and link them to preparation programs. It is important to understand the limitations of these measures and the inferences that they do and do not support. These limitations fall into three categories. First, value-added measures (VAM) provide…

Descriptors: Outcome Measures, Educational Quality, Graduates, Program Content

Regression Discontinuity Design in Gifted and Talented Education Research

Peer reviewed

Direct link

Matthews, Michael S.; Peters, Scott J.; Housand, Angela M. – Gifted Child Quarterly, 2012

This Methodological Brief introduces the reader to the regression discontinuity design (RDD), which is a method that when used correctly can yield estimates of research treatment effects that are equivalent to those obtained through randomized control trials and can therefore be used to infer causality. However, RDD does not require the random…

Descriptors: Control Groups, Gifted, Talent, Intervention

Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

Peer reviewed

Direct link

Kane, Michael – Language Testing, 2012

The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

Descriptors: Testing, Language Tests, Inferences, Test Validity

Clarifying the Consensus Definition of Validity

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012

The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…

Descriptors: Evidence, Validity, Educational Testing, Risk

Language Textbook Selection: Using Materials Analysis from the Perspective of SLA Principles

Peer reviewed

Direct link

Guilloteaux, Marie J. – Asia-Pacific Education Researcher, 2013

This paper outlines a procedure for language textbook analysis from the perspective of second language acquisition (SLA) principles as a preliminary procedure to evaluation for selection. The aim is to provide a tool that allows comparison of the potential of textbooks for supporting students' language learning. To this end, ten general principles…

Descriptors: Textbook Selection, English (Second Language), Second Language Learning, Second Language Instruction

The Role of Vocabulary Size in Predicting Performance on TOEFL Reading Item Types

Peer reviewed

Direct link

Alavi, Seyyed Mohammad; Akbarian, Is'haaq – System: An International Journal of Educational Technology and Applied Linguistics, 2012

This study aims to examine a) whether vocabulary knowledge, captured in the Vocabulary Levels Test (VLT), is related to the performance on the five types of reading comprehension items tested in TOEFL, i.e., Guessing Vocabulary, Main Idea, Inference, Reference, and Stated Detail; and b) whether EFL learners with different levels of vocabulary…

Descriptors: Knowledge Level, Test Items, English (Second Language), Reading Comprehension

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Assessment Quarterly	4
International Journal of…	2
Journal of Educational and…	2
Asia-Pacific Education…	1
Assessment in Education:…	1
Council of Chief State School…	1
Educational Measurement:…	1
Educational Researcher	1
Educational and Psychological…	1
Evaluation Review	1
Gifted Child Quarterly	1
Grantee Submission	1
Journal of Educational…	1
Journal of Research in…	1
Journal of Statistics…	1
Journal of Teacher Education	1
Journal of Visual Impairment…	1
Language Testing	1
Measurement and Evaluation in…	1
Measurement:…	1
NASSP Bulletin	1
Partnership for Assessment of…	1
SAGE Publications Ltd (CA)	1
Structural Equation Modeling:…	1
System: An International…	1
More ▼

Akbarian, Is'haaq	1
Alavi, Seyyed Mohammad	1
Bachman, Lyle F.	1
Beddow, Peter A.	1
Bejar, Issac I.	1
Blackburn, Marcy	1
Briggs, Derek C.	1
Cizek, Gregory J.	1
Coffman, Donna L.	1
Cooney, John B.	1
Craig K. Enders	1
Doorey, Nancy A.	1
Ferrell, Kay Alicyn	1
Floden, Robert E.	1
Forbes, Sean	1
Fox, Janna	1
Fraser, Mark W.	1
Giamellaro, Michael	1
Graham, Suzanne E.	1
Gregory Chernov	1
Guilloteaux, Marie J.	1
Guo, Shenyang	1
Housand, Angela M.	1
Kane, Michael	1
Kane, Michael T.	1
More ▼