ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Descriptor

Educational Testing	12
Test Theory	12
Educational Assessment	4
Measurement Techniques	4
Psychometrics	4
Scores	4
Academic Achievement	3
Error of Measurement	3
Test Construction	3
Testing Problems	3
College Entrance Examinations	2
Comparative Analysis	2
Construct Validity	2
Correlation	2
Criterion Referenced Tests	2
Educational History	2
Educational Policy	2
Educational Research	2
Generalizability Theory	2
Inferences	2
Item Response Theory	2
Mathematical Models	2
Measurement	2
Methods	2
Models	2
More ▼

Source

Educational Measurement:…	2
Evaluation in Education:…	2
ACT, Inc.	1
Contemporary Educational…	1
Educational Research and…	1
National Center for Analysis…	1
Online Submission	1
Review of Research in…	1

Publication Type

Reports - Evaluative	12
Journal Articles	8
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Higher Education	2
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

New York	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Test Affordances or Test Function? Did We Get Messick's Message Right?

Download full text

Salmani Nodoushan, Mohammad Ali – Online Submission, 2021

This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…

Descriptors: Construct Validity, Test Theory, Test Use, Affordances

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

Using the Theory of Successful Intelligence as a Framework for Developing Assessments in AP Physics

Peer reviewed

Direct link

Stemler, Steven E.; Sternberg, Robert J.; Grigorenko, Elena L.; Jarvin, Linda; Sharpes, Kirsten – Contemporary Educational Psychology, 2009

A new test of Advanced Placement Physics, explicitly designed to balance both content and cognitive-processing skills, was developed using Sternberg's theory of successful intelligence. The test was administered to 281 AP Physics students from 10 schools during the 2006-2007 school year. Six empirically distinguishable profiles of strengths and…

Descriptors: Science Tests, Intelligence, Advanced Placement, Ethnic Groups

A Perspective on the History of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997

The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)

Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Test Theory Reconceived.

Download full text

Mislevy, Robert J. – 1995

Educational test theory consists of statistical and methodological tools to support inferences about examinees' knowledge, skills, and accomplishments. The evolution of test theory has been shaped by the nature of users' inferences which, until recently, have been framed almost exclusively in terms of trait and behavioral psychology. Progress in…

Descriptors: Cognitive Psychology, Developmental Psychology, Educational Testing, Inferences

Selecting Items for Criterion-Referenced Tests.

Mellenbergh, Gideon J.; van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

Descriptors: Criterion Referenced Tests, Educational Testing, Item Analysis, Latent Trait Theory

Bayes Nets in Educational Assessment: Where Do the Numbers Come from? CSE Technical Report.

Download full text

Mislevy, Robert J.; Almond, Russell G.; Yan, Duanli; Steinberg, Linda S. – 2000

Educational assessments that exploit advances in technology and cognitive psychology can produce observations and pose student models that outstrip familiar test-theoretic models and analytic methods. Bayesian inference networks (BINs), which include familiar models and techniques as special cases, can be used to manage belief about students'…

Descriptors: Bayesian Statistics, Educational Assessment, Educational Technology, Educational Testing

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Measuring Effect Sizes: The Effect of Measurement Error. Working Paper 19

Download full text

Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008

Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Mislevy, Robert J.	2
van der Linden, Wim J.	2
Almond, Russell G.	1
Bos, Wilfried	1
Boyd, Donald	1
Brennan, Robert L.	1
Cui, Zhongmin	1
Fang, Yu	1
Goy, Martin	1
Grigorenko, Elena L.	1
Grossman, Pamela	1
Jarvin, Linda	1
Lankford, Hamilton	1
Loeb, Susanna	1
Mellenbergh, Gideon J.	1
Salmani Nodoushan, Mohammad…	1
Sharpes, Kirsten	1
Steinberg, Linda S.	1
Stemler, Steven E.	1
Sternberg, Robert J.	1
Traub, Ross E.	1
Traynor, Anne	1
Wendt, Heike	1
Wiliam, Dylan	1
Woodruff, David	1
More ▼