ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	41

Descriptor

Test Reliability	65
Scores	44
Test Validity	39
Test Construction	24
Scoring	15
Error of Measurement	14
Item Response Theory	13
Student Evaluation	13
Evaluation Methods	12
Cutting Scores	11
Elementary Secondary Education	11
Psychometrics	11
Testing	11
Achievement Tests	10
Test Items	10
Equated Scores	9
Scaling	9
Test Bias	9
Interrater Reliability	8
Standardized Tests	8
English	7
Foreign Countries	7
Language Tests	7
Screening Tests	7
Test Interpretation	7
More ▼

Publication Type

Reports - Descriptive	65
Journal Articles	38
Numerical/Quantitative Data	8
Tests/Questionnaires	3
Guides - Non-Classroom	2
Guides - General	1
Information Analyses	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	11
Elementary Secondary Education	9
Early Childhood Education	7
Middle Schools	7
Primary Education	7
Secondary Education	7
Grade 3	6
Grade 4	6
Junior High Schools	6
Grade 5	5
Grade 6	5
Grade 7	5
Intermediate Grades	5
Grade 8	4
High Schools	4
Higher Education	4
Postsecondary Education	3
Grade 9	2
Adult Basic Education	1
Adult Education	1
Grade 1	1
Grade 2	1
Kindergarten	1
More ▼

Audience

Researchers	5
Practitioners	3
Administrators	2
Policymakers	1
Teachers	1

Location

New York	4
Canada	2
New Mexico	2
Australia	1
China	1
Ireland (Dublin)	1
New York (New York)	1
Texas	1
United Kingdom	1
United Kingdom (England)	1
Vermont	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Reliability. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.

Descriptors: Scores, Measurement, Test Reliability, Error Patterns

Responsibilities of Users of Standardized Tests (Rust-4E)

Peer reviewed

Direct link

Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022

In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…

Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics

Validity. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.

Descriptors: Screening Tests, Scores, Test Validity, Test Reliability

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Conditional Precision of Measurement for Test Scores: Are Conditional Standard Errors Sufficient?

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2019

This inquiry is focused on three indicators of the precision of measurement--conditional on fixed values of ?, the latent variable of item response theory (IRT). The indicators that are compared are (1) The traditional, conditional standard errors, s(eX|?) = CSEM; (2) the IRT-based conditional standard errors, s[subscript irt](eX|?)=C[subscript…

Descriptors: Measurement, Accuracy, Scores, Error of Measurement

The Early Development Instrument -- Creation of a Fine Motor/Visual Motor Index

Peer reviewed

Direct link

Skelton, Heather; Leclair, Leanne – Journal of Occupational Therapy, Schools & Early Intervention, 2019

Research suggests that kindergarten fine motor (FM) and visual motor (VM) skills predict later school performance. Being able to identify if gaps exist in FM/VM readiness could inform FM/VM programming in the early years. The Early Development Instrument is used to assess school readiness in Canada and other countries. Through a Delphi method, a…

Descriptors: Psychomotor Skills, Kindergarten, School Readiness, Foreign Countries

English MAP Reading Fluency Technical Report: Based on Assessments Administered during the 2020-2021 School Year

Download full text

NWEA, 2022

This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…

Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency

Valid and Reliable Assessments. CSAI Update

Download full text

Center on Standards and Assessments Implementation, 2018

Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…

Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias

Making Sense of Elementary School Reading Scores. Literacy Leadership Brief

Direct link

Fitzgerald, Jill; Shanahan, Timothy E. – International Literacy Association, 2020

Reading scores exist for a continuum of purposes, from informal assessment to formal standardized tests. This brief aims to answer the question: What matters most for elementary-grade teachers when thinking about reading scores, and what could policymakers do to help teachers? Three positions worth pursuing in this regard are shared: (1) every…

Descriptors: Reading Achievement, Scores, Elementary School Students, Elementary School Teachers

Revealing Hidden Talents: The Development, Use, and Benefit of VESPARCH

Peer reviewed

Direct link

Badger, Julia R.; Mellanby, Jane – British Journal of Educational Psychology, 2018

Background: School attainment tests and Cognitive Abilities Tests are used in the United Kingdom to set targets for educational outcome. Whilst these are good predictors, they depend not only on basic ability but also on learnt knowledge and skills, such as reading. Method and Aims: VESPARCH is an online group test of verbal and spatial reasoning,…

Descriptors: Foreign Countries, Intelligence Tests, Verbal Ability, Spatial Ability

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Stepping Outside the Normed Sample: Implications for Validity

Peer reviewed

Direct link

Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017

We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…

Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction

Assessments 101: A Policymaker's Guide to K-12 Assessments

Download full text

Woods, Julie – Education Commission of the States, 2017

Assessments come in many forms in part because they serve many purposes, and those purposes often vary by the stakeholders they support. Students, parents, teachers, and school, district and state leaders may all be end users of the information provided by various assessments. This brief supports state leaders' understanding of assessments by…

Descriptors: Elementary Secondary Education, Educational Assessment, Student Evaluation, Guides

Designing and Assessing a Digital, Discipline-Specific Literacy Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Kebble, Paul Graham – The EUROCALL Review, 2016

The C-Test as a tool for assessing language competence has been in existence for nearly 40 years, having been designed by Professors Klein-Braley and Raatz for implementation in German and English. Much research has been conducted over the ensuing years, particularly in regards to reliability and construct validity, for which it is reported to…

Descriptors: Language Tests, Computer Software, Test Construction, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Measurement and Evaluation in…	4
Diagnostique	3
Educational Measurement:…	3
Evaluation and the Health…	3
New York State Education…	3
Assessment for Effective…	2
Educational and Psychological…	2
National Center on Improving…	2
New Meridian Corporation	2
New Mexico Public Education…	2
ACT, Inc.	1
Academic Medicine	1
Applied Measurement in…	1
Applied Psychological…	1
Black Issues in Higher…	1
British Journal of…	1
Center on Standards and…	1
ETS Research Report Series	1
Education Commission of the…	1
GED Testing Service	1
International Journal of…	1
International Journal of…	1
International Literacy…	1
Journal of Autism and…	1
Journal of Occupational…	1
More ▼

Erford, Bradley T.	3
Hays, Danica G.	2
Pentimonti, J.	2
Petscher, Y.	2
Stanley, C.	2
Abedi, Jamal	1
Allalouf, Avi	1
Ault, Haley	1
Badger, Julia R.	1
Baird, Jo-Anne	1
Balkin, Richard S.	1
Bardhoshi, Gerta	1
Bardos, Achilles N.	1
Barrio Minton, Casey	1
Beddow, Peter A.	1
Biddison, Amanda R.	1
Black, Paul	1
Boller, Kimberly	1
Booker, Kevin	1
Brolin, Donn E.	1
Brown, Jonathan R.	1
Bruch, Julie	1
Bucher, Dale E.	1
Buxbaum, Joseph D.	1
More ▼

ACT Assessment	2
General Educational…	2
Iowa Tests of Basic Skills	2
SAT (College Admission Test)	2
Autism Diagnostic Observation…	1
Bracken Basic Concept Scale	1
College Level Examination…	1
Collegiate Assessment of…	1
Dynamic Indicators of Basic…	1
Graduate Management Admission…	1
International English…	1
Iowa Tests of Educational…	1
Measures of Academic Progress	1
National Assessment of Adult…	1
National Assessment of…	1
North Carolina End of Course…	1
Preliminary Scholastic…	1
Program for International…	1
Stanford Achievement Tests	1
Test of Standard Written…	1
Texas Essential Knowledge and…	1
More ▼