ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	50

Descriptor

Scores	98
Test Validity	98
Testing	98
Test Reliability	40
Test Construction	27
Test Interpretation	21
Standardized Tests	18
Language Tests	17
Scoring	16
Achievement Tests	15
Test Bias	15
Test Items	13
Academic Achievement	12
English (Second Language)	11
Foreign Countries	11
Test Use	11
Comparative Analysis	10
Elementary Secondary Education	10
Item Analysis	10
Second Language Learning	10
Statistical Analysis	10
Student Evaluation	10
Testing Problems	10
Evaluation Methods	9
High Stakes Tests	9
More ▼

Publication Type

Journal Articles	58
Reports - Research	35
Reports - Evaluative	19
Opinion Papers	16
Reports - Descriptive	12
Numerical/Quantitative Data	8
Speeches/Meeting Papers	8
Guides - Non-Classroom	6
Tests/Questionnaires	5
Information Analyses	4
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Higher Education	10
Elementary Education	8
Postsecondary Education	7
Secondary Education	7
Grade 5	6
Grade 3	5
High Schools	5
Early Childhood Education	4
Grade 4	4
Grade 6	4
Grade 7	4
Grade 9	4
Intermediate Grades	4
Junior High Schools	4
Middle Schools	4
Primary Education	4
Elementary Secondary Education	3
Grade 8	3
Grade 10	2
Grade 11	2
Grade 12	2
Adult Basic Education	1
Adult Education	1
More ▼

Audience

Practitioners	4
Researchers	4
Teachers	3
Community	1
Parents	1
Policymakers	1

Location

United Kingdom	2
Canada	1
China	1
Cyprus	1
Georgia (Atlanta)	1
Iran	1
Israel	1
Massachusetts	1
Nebraska	1
Sweden	1
Turkey	1
United Kingdom (England)	1
United Kingdom (London)	1
United States	1
Utah	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Elementary and Secondary…	1
Every Student Succeeds Act…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 98 results Save | Export

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

How Administration Stakes and Settings Affect Student Behavior and Performance on a Biology Concept Assessment

Peer reviewed

Direct link

Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023

Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…

Descriptors: Biology, Science Tests, High Stakes Tests, Scores

Impact of Superscoring on Subgroup Differences. Issue Brief

Download full text

Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019

When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…

Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods

Making Sense of Elementary School Reading Scores. Literacy Leadership Brief

Direct link

Fitzgerald, Jill; Shanahan, Timothy E. – International Literacy Association, 2020

Reading scores exist for a continuum of purposes, from informal assessment to formal standardized tests. This brief aims to answer the question: What matters most for elementary-grade teachers when thinking about reading scores, and what could policymakers do to help teachers? Three positions worth pursuing in this regard are shared: (1) every…

Descriptors: Reading Achievement, Scores, Elementary School Students, Elementary School Teachers

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Test Review: Current Options in At-Home Language Proficiency Tests for Making High-Stakes Decisions

Peer reviewed

Direct link

Isbell, Daniel R.; Kremmel, Benjamin – Language Testing, 2020

Administration of high-stakes language proficiency tests has been disrupted in many parts of the world as a result of the 2019 novel coronavirus pandemic. Institutions that rely on test scores have been forced to adapt, and in many cases this means using scores from a different test, or a new online version of an existing test, that can be taken…

Descriptors: Language Tests, High Stakes Tests, Language Proficiency, Second Language Learning

Tests, Test Scores, and Constructs

Peer reviewed

Direct link

Haertel, Edward H. – Educational Psychologist, 2018

In the service of educational accountability, student achievement tests are being used to measure constructs quite unlike those envisioned by test developers. Scores are compared to cut points to create classifications like "proficient"; scores are combined over time to measure growth; student scores are aggregated to measure the…

Descriptors: Achievement Tests, Scores, Test Validity, Test Interpretation

Comparing the Impact of Online and Paper-and-Pencil Administration of the Self-Determination Inventory: Student Report

Peer reviewed

Direct link

Raley, Sheida K.; Shogren, Karrie A.; Rifenbark, Graham G.; Anderson, Mark H.; Shaw, Leslie A. – Journal of Special Education Technology, 2020

The Self-Determination Inventory: Student Report (SDI: SR) was developed to measure the self-determination of adolescents and was recently validated for students aged 13-22 with and without disabilities across diverse racial/ethnic backgrounds. The SDI: SR is aligned Causal Agency Theory and its theoretical conceptualizations of self-determined…

Descriptors: Testing, Self Determination, Scores, Students with Disabilities

Test Review: TestDaF

Peer reviewed

Direct link

Norris, John; Drackert, Anastasia – Language Testing, 2018

The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…

Descriptors: German, Second Language Learning, Language Tests, Language Proficiency

The Importance of Assessment Literacy for Language Faculty

Peer reviewed

Direct link

DiBiase-Lubrano, Mary Jo – Unterrichtspraxis/Teaching German, 2018

Language testing is an integral part of teaching and learning, yet most language faculty do not receive adequate training for developing tests (Taylor, [Taylor, L., 2009]). Most have advanced degrees in literary and cultural studies in the target language but often have insufficient training in pedagogy and assessment. This shortcoming is alarming…

Descriptors: German, Second Language Learning, Second Language Instruction, Language Tests

National Reference Test Results Digest, 2021

Download full text

Burge, Bethan; Benson, Louise – National Foundation for Educational Research, 2021

Since its introduction in 2017, NFER has been contracted by Ofqual to develop, deliver and analyse the results of the National Reference Test (NRT) in English and maths. The NRT is administered annually and shows if student performance in English and maths at GCSE level has changed from year to year. The NRT results are based on analysis of data…

Descriptors: National Competency Tests, Test Results, English, Mathematics Tests

Test Review: Wagner, R. K., Torgesen, J. K., Rashotte, C. A., & Pearson, N. A., "Comprehensive Test of Phonological Processing-2nd Ed. (CTOPP-2)." Austin, Texas: Pro-Ed

Peer reviewed

Direct link

Dickens, Rachel H.; Meisinger, Elizabeth B.; Tarar, Jessica M. – Canadian Journal of School Psychology, 2015

The Comprehensive Test of Phonological Processing-Second Edition (CTOPP-2; Wagner, Torgesen, Rashotte, & Pearson, 2013) is a norm-referenced test that measures phonological processing skills related to reading for individuals aged 4 to 24. According to its authors, the CTOPP-2 may be used to identify individuals who are markedly below their…

Descriptors: Norm Referenced Tests, Phonology, Test Format, Testing

Involving Diverse Communities of Practice to Minimize Unintended Consequences of Test-Based Accountability Systems

Peer reviewed

Direct link

Behizadeh, Nadia; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2015

In his focus article, Koretz (this issue) argues that accountability has become the primary function of large-scale testing in the United States. He then points out that tests being used for accountability purposes are flawed and that the high-stakes nature of these tests creates a context that encourages score inflation. Koretz is concerned about…

Descriptors: Communities of Practice, High Stakes Tests, Testing, Test Validity

Conceptions of Validity: The Private and the Public

Peer reviewed

Direct link

Braun, Henry – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton is to be commended for addressing as challenging a topic as the clarification of the concept of validity. The impetus for this foray is Newton's judgment that, despite decades of development, the definition and elaboration of the term test validity in the 1999 "Standards" retains sufficient ambiguity to permit, if not invite, both…

Descriptors: Educational Improvement, Test Validity, Validity, Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Language Testing	8
Measurement:…	5
Partnership for Assessment of…	4
Assessment in Education:…	2
Educational Psychologist	2
Educational Researcher	2
Journal of Psychoeducational…	2
New Meridian Corporation	2
ACT, Inc.	1
AEDS Journal	1
American Psychologist	1
Applied Psychological…	1
Assessment in Education…	1
Australian Journal of…	1
CBE - Life Sciences Education	1
Canadian Journal of School…	1
Catalyst, The Journal of the…	1
Center for Assessment and…	1
Communique	1
Diagnostique	1
ESL Magazine	1
Edinburgh Working Papers in…	1
Educational Measurement:…	1
Educational Testing Service	1
Illinois School Research and…	1
More ▼

Kane, Michael	4
Davies, Alan	2
Kapes, Jerome T.	2
Aiken, Lewis R.	1
Allen, Thomas E.	1
Amery D. Wu	1
Anderson, Mark H.	1
Anderson, Paul S.	1
Baker, Beverly A.	1
Bardo, John W.	1
Barker, Larry L.	1
Behizadeh, Nadia	1
Benson, Louise	1
Bornstein, Robert F.	1
Botting, Nicola	1
Braun, Henry	1
Bridgham, Robert G.	1
Burge, Bethan	1
Butler, Frances A.	1
Camara, Wayne J.	1
Chu, Yiting	1
Couch, Brian A.	1
Cziko, Gary A.	1
Darling-Hammond, Linda	1
More ▼

Test of English as a Foreign…	3
ACT Assessment	2
International English…	2
National Assessment of…	2
Wechsler Adult Intelligence…	2
Clinical Evaluation of…	1
General Aptitude Test Battery	1
Measures of Academic Progress	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Strengths and Difficulties…	1
Test of Adult Basic Education	1
Torrance Tests of Creative…	1
Vineland Adaptive Behavior…	1
Wechsler Intelligence Scale…	1
Wechsler Memory Scale	1
Wide Range Achievement Test	1
Woodcock Johnson Tests of…	1
More ▼