ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	13

Descriptor

Test Format	71
Test Reliability	61
Test Validity	49
Test Interpretation	33
Standardized Tests	32
Testing	32
Test Content	29
Test Reviews	25
Test Construction	24
Disability Identification	23
Screening Tests	21
Disabilities	16
Elementary Secondary Education	15
Higher Education	14
Child Development	11
Children	11
Scoring	10
Test Norms	10
Early Identification	9
Test Items	9
Adolescents	8
Essay Tests	8
Foreign Countries	8
Writing Evaluation	8
Clinical Diagnosis	7
More ▼

Publication Type

Reports - Descriptive	71
Journal Articles	48
Reports - Research	9
Guides - Classroom - Teacher	3
Opinion Papers	3
Speeches/Meeting Papers	3
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Adult Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	7
Teachers	7
Administrators	4
Researchers	2

Location

California	7
Georgia	2
Japan	2
Canada	1
Hong Kong	1
Nebraska	1
United Kingdom	1
Vermont	1

Laws, Policies, & Programs

Assessments and Surveys

Peabody Picture Vocabulary…	2
SAT (College Admission Test)	2
ACT Assessment	1
Behavior Assessment System…	1
Conners Rating Scales	1
Developmental Indicators for…	1
Kaufman Brief Intelligence…	1
Kaufman Test of Educational…	1
Measures of Academic Progress	1
National Assessment of…	1
Oral and Written Language…	1
Peabody Individual…	1
Test of Standard Written…	1
Vineland Adaptive Behavior…	1
Wechsler Individual…	1
Woodcock Diagnostic Reading…	1
Woodcock Reading Mastery Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 71 results Save | Export

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Establishing Survey Validity: A Practical Guide

Peer reviewed
PDF on ERIC

Download full text

Cobern, William W.; Adams, Betty A. J. – International Journal of Assessment Tools in Education, 2020

What follows is a practical guide for establishing the validity of a survey for research purposes. The motivation for providing this guide is our observation that researchers, not necessarily being survey researchers per se, but wanting to use a survey method, lack a concise resource on validity. There is far more to know about surveys and survey…

Descriptors: Surveys, Test Validity, Test Construction, Test Items

English MAP Reading Fluency Technical Report: Based on Assessments Administered during the 2020-2021 School Year

Download full text

NWEA, 2022

This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…

Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency

L2 Speaking Assessment in Secondary School Classrooms in Japan

Peer reviewed

Direct link

Koizumi, Rie – Language Assessment Quarterly, 2022

In Japanese secondary schools, speaking assessment in English classrooms is designed, conducted, and scored by teachers. Although the assessment is intended to be used for summative and formative purposes, it is not regularly or adequately practiced. This paper reports the problems (i.e., lack of continuous speaking assessment, limited speaking…

Descriptors: Secondary Schools, English (Second Language), Second Language Instruction, Language Tests

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

High-Stakes Examinations and Large-Scale Learning Assessments in Times of Emergencies and Crises. NEQMAP 2020 Thematic Review

Direct link

Magno, Carlo – UNESCO Bangkok, 2020

The COVID-19 pandemic has disrupted education across the globe leading countries to adapt how they administer and manage high-stakes examinations and large-scale learning assessments. This thematic review describes the measures that countries have taken, in terms of policies and practices, when learning assessments are disrupted by emergencies and…

Descriptors: High Stakes Tests, COVID-19, Pandemics, Cross Cultural Studies

Five Things Not to Do in Developing Surveys for Assessment in Student Affairs. NASPA Research and Policy Institute Issue Brief

Download full text

Sriram, Rishi – NASPA - Student Affairs Administrators in Higher Education, 2014

When student affairs professionals assess their work, they often employ some type of survey. The use of surveys stems from a desire to objectively measure outcomes, a demand from someone else (e.g., supervisor, accreditation committee) for data, or the feeling that numbers can provide an aura of competence. Although surveys are effective tools for…

Descriptors: Surveys, Test Construction, Student Personnel Services, Test Use

Adding Rigor to Classroom Assessment Techniques for Non-Traditional Adult Programs: A Lifecycle Improvement Approach

Peer reviewed
PDF on ERIC

Download full text

Thomas, Jason E.; Hornsey, Philip E. – Journal of Instructional Research, 2014

Formative Classroom Assessment Techniques (CAT) have been well-established instructional tools in higher education since their exposition in the late 1980s (Angelo & Cross, 1993). A large body of literature exists surrounding the strengths and weaknesses of formative CATs. Simpson-Beck (2011) suggested insufficient quantitative evidence exists…

Descriptors: Classroom Techniques, Nontraditional Education, Adult Education, Formative Evaluation

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Data Driven Decision Making in the Social Studies

Peer reviewed

Direct link

Ediger, Marlow – Education, 2010

Data driven decision making emphasizes the importance of the teacher using objective sources of information in developing the social studies curriculum. Too frequently, decisions of teachers have been made based on routine and outdated methods of teaching. Valid and reliable tests used to secure results from pupil learning make for better…

Descriptors: Data, Decision Making, Social Studies, Standardized Tests

A Rasch Perspective

Peer reviewed

Direct link

Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…

Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

Detecting Intrajudge Inconsistency in Standard Setting Using Test Items with a Selected-Response Format. Research Report.

Download full text

van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000

In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…

Descriptors: Interrater Reliability, Judges, Probability, Standard Setting

Sacrificing Reliability and Exalting Sampling Error at the Altar of Parsimony: Some Cautions Concerning Short-Form Test Development.

Download full text

Henson, Robin K. – 2000

The purpose of this paper is to highlight some psychometric cautions that should be observed when seeking to develop short form versions of tests. Several points are made: (1) score reliability is impacted directly by the characteristics of the sample and testing conditions; (2) sampling error has a direct influence on reliability and factor…

Descriptors: Factor Structure, Psychometrics, Reliability, Sampling

A Mexican Version of the Peabody Picture Vocabulary Test.

Peer reviewed

Simon, Alan J.; Joiner, Lee M. – Journal of Educational Measurement, 1976

The purpose of this study was to determine whether a Mexican version of the Peabody Picture Vocabulary Test could be improved by directly translating both forms of the American test, then using decision procedures to select the better item of each pair. The reliability of the simple translations suffered. (Author/BW)

Descriptors: Early Childhood Education, Spanish, Test Construction, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Diagnostique	25
Academic Medicine	1
Applied Psychological…	1
ETS Research Report Series	1
Education	1
Education and Training in…	1
Educational Measurement:…	1
Educational and Psychological…	1
Evaluation and the Health…	1
International Journal of…	1
Journal of College Admission	1
Journal of Creative Behavior	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Instructional…	1
Language Assessment Quarterly	1
Measurement:…	1
Mid-Western Educational…	1
NASPA - Student Affairs…	1
NWEA	1
Nebraska Department of…	1
New Directions for Teaching…	1
RSR: Reference Services Review	1
ReCALL	1
More ▼

White, Edward M.	6
Adams, Betty A. J.	1
Ashton, Tamarah M.	1
Awwad, Muhammad	1
Bachor, Dan G.	1
Bartels, Don R.	1
Brady, Michael P.	1
Brennan, Robert L.	1
Brown, James Dean	1
Brown, William R.	1
Brozovich, Richard	1
Bruno, Rachelle M.	1
Calderbank, Mark	1
Carlsson, Ingegerd	1
Casabianca, Jodi M.	1
Chang, Lei	1
Clariana, Roy B.	1
Clyman, Stephen G.	1
Cobern, William W.	1
Coniam, David	1
Dixon, John	1
Ediger, Marlow	1
Eichelberger, R. Tony	1
Embretson, Susan E.	1
More ▼