ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	16

Descriptor

Evaluation Methods	48
Scores	48
Test Interpretation	48
Educational Assessment	15
Test Results	15
Test Validity	14
Measurement Techniques	13
Student Evaluation	13
Elementary Secondary Education	12
Achievement Tests	9
Comparative Analysis	8
Measurement	8
Norm Referenced Tests	8
Test Use	8
Academic Achievement	7
Educational Testing	7
Psychometrics	7
Test Construction	7
Test Norms	7
Test Reliability	7
Testing	7
Cultural Context	6
Standardized Tests	6
Testing Problems	6
Business Administration	5
More ▼

Source

International Journal of…	5
Educational Measurement:…	3
Assessment for Effective…	2
ProQuest LLC	2
ETS Research Report Series	1
Educational Psychologist	1
Journal of Educational…	1
Journal of Research in…	1
Language, Speech, and Hearing…	1
Psychological Assessment	1
Research Papers in Education	1
Rowman & Littlefield…	1
TESOL Quarterly: A Journal…	1
Teachers College Record	1
US Department of Education	1
Young Children	1
More ▼

Publication Type

Journal Articles	20
Reports - Evaluative	14
Reports - Research	13
Opinion Papers	7
Reports - Descriptive	7
Speeches/Meeting Papers	6
Guides - Non-Classroom	4
Books	3
ERIC Digests in Full Text	3
ERIC Publications	3
Dissertations/Theses -…	2
Tests/Questionnaires	2
Collected Works - Serials	1
Guides - General	1
Information Analyses	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	4
Secondary Education	3
Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Elementary Education	1
Grade 6	1
Grade 8	1
Intermediate Grades	1

Audience

Practitioners	8
Teachers	3
Policymakers	2
Researchers	2
Administrators	1
Community	1

Location

Texas (Austin)	2
United Kingdom	2
Alaska	1
Australia	1
California	1
Canada	1
Connecticut	1
Kentucky (Louisville)	1
Qatar	1
South Africa	1
United States	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	2
General Educational…	1
Iowa Tests of Basic Skills	1
Measures of Academic Progress	1
Pennsylvania Educational…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 48 results Save | Export

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Re-Examining Measurement Invariance of School Climate Surveys across Race/Ethnicity

Peer reviewed

Direct link

Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025

Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…

Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Dynamic Measurement: A Theoretical-Psychometric Paradigm for Modern Educational Psychology

Peer reviewed

Direct link

Dumas, Denis; McNeish, Daniel; Greene, Jeffrey A. – Educational Psychologist, 2020

Scholars have lamented that current methods of assessing student performance do not align with contemporary views of learning as situated within students, contexts, and time. Here, we introduce and describe one theoretical--psychometric paradigm--termed "dynamic measurement"--designed to provide a valid representation of the way students…

Descriptors: Alternative Assessment, Psychometrics, Educational Psychology, Student Evaluation

Making Sense of Learner Performance on Tests of Productive Vocabulary Knowledge

Peer reviewed

Direct link

Fitzpatrick, Tess; Clenton, Jon – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2017

This article offers a solution to a significant problem for teachers and researchers of language learning that confounds their interpretations and expectations of test data: The apparent simplicity of tests of vocabulary knowledge masks the complexity of the constructs they claim to measure. The authors first scrutinise task elements in two widely…

Descriptors: Language Tests, Vocabulary Development, Difficulty Level, Performance Factors

The Many Facets of PISA

Peer reviewed

Direct link

Berliner, David C. – Teachers College Record, 2015

Trying to understand PISA is analogous to the parable of the blind men and the elephant. There are many facets of the PISA program, and thus many ways to both applaud and critique this ambitious international program of assessment that has gained enormous importance in the crafting of contemporary educational policy. One of the facets discussed in…

Descriptors: Achievement Tests, Standardized Tests, Educational Assessment, Educational Indicators

Statistical Profiling of Academic Oral English Proficiency Based on an ITA Screening Test

Direct link

Choi, Ick Kyu – ProQuest LLC, 2013

At the University of California, Los Angeles, the Test of Oral Proficiency (TOP), an internally developed oral proficiency test, is administered to international teaching assistant (ITA) candidates to ensure an appropriate level of academic oral English proficiency. Test taker performances are rated live by two raters according to four subscales.…

Descriptors: Screening Tests, Profiles, Oral Language, English

Evaluating Academic Progress without a Vertical Scale. Research Report. ETS RR-12-07

Peer reviewed
PDF on ERIC

Download full text

Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012

Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…

Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation

Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

Direct link

Klesch, Heather S. – ProQuest LLC, 2010

The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing

Evaluating the Rank-Ordering Method for Standard Maintaining

Peer reviewed

Direct link

Bramley, Tom; Gill, Tim – Research Papers in Education, 2010

The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…

Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries

Controversies regarding the Nature of Score Validity: Still Crazy after All These Years.

Download full text

Gray, B. Thomas – 1997

Validity is a critically important issue with far-reaching implications for testing. The history of conceptualizations of validity over the past 50 years is reviewed, and 3 important areas of controversy are examined. First, the question of whether the three traditionally recognized types of validity should be integrated as a unitary entity of…

Descriptors: Educational Testing, Evaluation Methods, Reliability, Scores

Summarizing Change in Test Scores: Shortcomings of Three Common Methods. ERIC Digest.

Download full text

Russell, Michael – 2000

This Digest introduces the advantages and disadvantages of three commonly used methods of reporting test score changes: (1) change in percentile rank; (2) scale or raw score change; and (3) percent change. The change in percentile rank method focuses on the increase or decrease of the mean percentile ranking for a group of students. This method…

Descriptors: Achievement Gains, Change, Evaluation Methods, Scores

Intuitive Test Theory. CSE Report 631

Download full text

Braun, Henry I.; Mislevy, Robert J. – US Department of Education, 2004

Psychologist Andrea diSessa coined the term "phenomenological primitives", or p-prims, to talk about nonexperts' reasoning about physical situations. P-prims are primitive in the sense that they stand without significant explanatory substructure or explanation. Examples are "Heavy objects fall faster than light objects" and "Continuing force is…

Descriptors: Test Theory, Testing, Evaluation Methods, Scores

Reporting and Interpreting Test Results.

Download full text

Harris, Deborah J. – 2003

Tests and assessments are generally administered to gather data to aid in decision making, with at an individual student level or at an aggregated level. In order to incorporate assessment data in informed decision making, test users need to understand the test results. This chapter highlights the types of test scores and test score…

Descriptors: Decision Making, Educational Assessment, Educational Testing, Evaluation Methods

Using Expected Growth Size Estimates To Summarize Test Score Changes. ERIC/AE Digest.

Download full text

Russell, Michael – 2000

An earlier Digest described the shortcomings of three methods commonly used to summarize changes in test scores. This Digest describes two less commonly used approaches for examining changes in test scores, those of Standardized Growth Estimates and Effect Sizes. Aspects of these two approaches are combined and applied to the Iowa Test of Basic…

Descriptors: Achievement Gains, Change, Effect Size, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Russell, Michael	2
An, Lily Shiao	1
Arter, Judith A.	1
Ayala, Carlos C.	1
Banta, Trudy W.	1
Bartram, Dave	1
Berliner, David C.	1
Bramley, Tom	1
Braun, Henry I.	1
Choi, Ick Kyu	1
Clenton, Jon	1
Cole, Nancy S.	1
Cone, John D.	1
Davis, Laurie Laughlin	1
Dena Dossett	1
DiVesta, Francis J.	1
Dumas, Denis	1
Fagan, Barbara M.	1
Fitzpatrick, Tess	1
Foster, Jeff L.	1
Foster, Sharon L.	1
Gill, Tim	1
Gray, B. Thomas	1
Greene, Jeffrey A.	1
More ▼