ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	27

Publication Type

Reports - Descriptive	44
Journal Articles	28
Numerical/Quantitative Data	3
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	7
Elementary Secondary Education	4
High Schools	4
Middle Schools	4
Secondary Education	4
Early Childhood Education	3
Grade 3	3
Grade 4	3
Higher Education	3
Junior High Schools	3
Primary Education	3
Grade 5	2
Grade 6	2
Grade 7	2
Grade 9	2
Intermediate Grades	2
Postsecondary Education	2
Adult Basic Education	1
Adult Education	1
Grade 1	1
Grade 2	1
Grade 8	1
More ▼

Audience

Researchers	5
Practitioners	3
Administrators	2
Teachers	1

Location

Australia	1
Canada	1
China	1
United Kingdom	1
United Kingdom (England)	1
Vermont	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
General Educational…	2
Bracken Basic Concept Scale	1
College Level Examination…	1
Collegiate Assessment of…	1
Dynamic Indicators of Basic…	1
Graduate Management Admission…	1
International English…	1
Iowa Tests of Basic Skills	1
National Assessment of Adult…	1
North Carolina End of Course…	1
Preliminary Scholastic…	1
Program for International…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Reliability. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.

Descriptors: Scores, Measurement, Test Reliability, Error Patterns

Responsibilities of Users of Standardized Tests (Rust-4E)

Peer reviewed

Direct link

Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022

In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…

Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics

Validity. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.

Descriptors: Screening Tests, Scores, Test Validity, Test Reliability

Conditional Precision of Measurement for Test Scores: Are Conditional Standard Errors Sufficient?

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2019

This inquiry is focused on three indicators of the precision of measurement--conditional on fixed values of ?, the latent variable of item response theory (IRT). The indicators that are compared are (1) The traditional, conditional standard errors, s(eX|?) = CSEM; (2) the IRT-based conditional standard errors, s[subscript irt](eX|?)=C[subscript…

Descriptors: Measurement, Accuracy, Scores, Error of Measurement

Valid and Reliable Assessments. CSAI Update

Download full text

Center on Standards and Assessments Implementation, 2018

Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…

Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias

Making Sense of Elementary School Reading Scores. Literacy Leadership Brief

Direct link

Fitzgerald, Jill; Shanahan, Timothy E. – International Literacy Association, 2020

Reading scores exist for a continuum of purposes, from informal assessment to formal standardized tests. This brief aims to answer the question: What matters most for elementary-grade teachers when thinking about reading scores, and what could policymakers do to help teachers? Three positions worth pursuing in this regard are shared: (1) every…

Descriptors: Reading Achievement, Scores, Elementary School Students, Elementary School Teachers

Revealing Hidden Talents: The Development, Use, and Benefit of VESPARCH

Peer reviewed

Direct link

Badger, Julia R.; Mellanby, Jane – British Journal of Educational Psychology, 2018

Background: School attainment tests and Cognitive Abilities Tests are used in the United Kingdom to set targets for educational outcome. Whilst these are good predictors, they depend not only on basic ability but also on learnt knowledge and skills, such as reading. Method and Aims: VESPARCH is an online group test of verbal and spatial reasoning,…

Descriptors: Foreign Countries, Intelligence Tests, Verbal Ability, Spatial Ability

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Stepping Outside the Normed Sample: Implications for Validity

Peer reviewed

Direct link

Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017

We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…

Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction

Designing and Assessing a Digital, Discipline-Specific Literacy Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Kebble, Paul Graham – The EUROCALL Review, 2016

The C-Test as a tool for assessing language competence has been in existence for nearly 40 years, having been designed by Professors Klein-Braley and Raatz for implementation in German and English. Much research has been conducted over the ensuing years, particularly in regards to reliability and construct validity, for which it is reported to…

Descriptors: Language Tests, Computer Software, Test Construction, Test Reliability

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

New Meridian Technical Report 2018-2019

Download full text

New Meridian Corporation, 2020

The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…

Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation

New Meridian Technical Report 2018-2019: Alternate Blueprint

Download full text

New Meridian Corporation, 2020

The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…

Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation

Test Theories, Educational Priorities and Reliability of Public Examinations in England

Peer reviewed

Direct link

Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013

Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…

Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3

Measurement and Evaluation in…	4
Assessment for Effective…	2
Diagnostique	2
Educational and Psychological…	2
National Center on Improving…	2
New Meridian Corporation	2
ACT, Inc.	1
Academic Medicine	1
Applied Measurement in…	1
Applied Psychological…	1
Black Issues in Higher…	1
British Journal of…	1
Center on Standards and…	1
ETS Research Report Series	1
Educational Measurement:…	1
GED Testing Service	1
International Journal of…	1
International Journal of…	1
International Literacy…	1
Language Testing	1
Language, Speech, and Hearing…	1
Multivariate Behavioral…	1
NASSP Bulletin	1
Northwest Evaluation…	1
Phi Delta Kappan	1
More ▼

Scores	44
Test Reliability	44
Test Validity	25
Evaluation Methods	11
Test Construction	10
Student Evaluation	9
Error of Measurement	8
Psychometrics	8
Elementary Secondary Education	7
Test Items	7
Testing	7
Item Response Theory	6
Scoring	6
Screening Tests	6
Test Bias	6
Test Interpretation	6
Test Results	6
Achievement Tests	5
Computer Assisted Testing	5
Foreign Countries	5
Higher Education	5
Item Analysis	5
Standardized Tests	5
Statistical Analysis	5
Achievement Gains	4
More ▼

Erford, Bradley T.	3
Hays, Danica G.	2
Pentimonti, J.	2
Petscher, Y.	2
Stanley, C.	2
Abedi, Jamal	1
Ault, Haley	1
Badger, Julia R.	1
Baird, Jo-Anne	1
Balkin, Richard S.	1
Bardhoshi, Gerta	1
Bardos, Achilles N.	1
Barrio Minton, Casey	1
Beddow, Peter A.	1
Biddison, Amanda R.	1
Black, Paul	1
Boller, Kimberly	1
Booker, Kevin	1
Brown, Jonathan R.	1
Bruch, Julie	1
Chenoweth, Karin	1
Cizek, Gregory J.	1
Crocker, Linda	1
Dallape, Aprille	1
More ▼