ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	23
Since 2016 (last 10 years)	54

Descriptor

Testing Problems	54
Scores	40
Foreign Countries	15
Language Tests	15
Test Items	15
Second Language Learning	14
Item Response Theory	12
Test Validity	11
English (Second Language)	10
Computer Assisted Testing	9
Equated Scores	9
Test Reliability	9
Second Language Instruction	8
Difficulty Level	6
High Stakes Tests	6
Language Proficiency	6
Standardized Tests	6
Achievement Tests	5
Psychometrics	5
Student Attitudes	5
Teacher Attitudes	5
Teaching Methods	5
Test Interpretation	5
Test Preparation	5
Academic Achievement	4
More ▼

Publication Type

Journal Articles	50
Reports - Research	34
Reports - Evaluative	16
Dissertations/Theses -…	2
Information Analyses	2
Opinion Papers	2
Reports - Descriptive	2
Tests/Questionnaires	2
Numerical/Quantitative Data	1

Education Level

Higher Education	11
Postsecondary Education	11
Secondary Education	5
Elementary Education	3
Elementary Secondary Education	2
Early Childhood Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Location

China	4
Thailand	2
United Kingdom	2
Europe	1
Georgia	1
Germany	1
Illinois	1
Iran	1
Japan	1
New Jersey	1
North Carolina	1
Ohio	1
Taiwan	1
United States	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

International English…	4
Test of English as a Foreign…	3
Wechsler Intelligence Scale…	3
ACT Assessment	1
ACTFL Oral Proficiency…	1
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Measures of Academic Progress	1
Metropolitan Achievement Tests	1
National Assessment of…	1
National Education…	1
Otis Lennon School Ability…	1
Program for International…	1
Progress in International…	1
Stanford Achievement Tests	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

Explaining Performance Decline over the Course of Taking Comprehensive Proficiency Tests: The Roles of Effort and Omission Propensity

Peer reviewed

Direct link

Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024

In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…

Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

Population Invariance in Composite-Score Equating with the Random Groups Design

Direct link

Chang, Kuo-Feng – ProQuest LLC, 2022

This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…

Descriptors: Test Items, Equated Scores, Methods, Design

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Estimating Learning When Test Scores Are Missing: The Problem and Two Solutions. EdWorkingPaper No. 23-864

Download full text

Paul T. von Hippel – Annenberg Institute for School Reform at Brown University, 2023

Longitudinal studies can produce biased estimates of learning if children miss tests. In an application to summer learning, we illustrate how missing test scores can create an illusion of large summer learning gaps when true gaps are close to zero. We demonstrate two methods that reduce bias by exploiting the correlations between missing and…

Descriptors: Testing Problems, Scores, Educational Research, Longitudinal Studies

Perceptions of Test Score Pollution Stemming from COVID-19 and State Testing: An Exploratory Case Study

Direct link

Kalemdaroglu-Wheeler, Elif – ProQuest LLC, 2023

The purpose of this qualitative exploratory case study was to explore teachers' and administrators' perceptions of test score pollution deriving from COVID-19-related issues that may affect students' test scores on state-mandated standardized tests for grades six through 12 in a state along the Atlantic Coast of the United States. Four research…

Descriptors: Testing Problems, Scores, COVID-19, Pandemics

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

To What Degree Does Rapid Guessing Distort Aggregated Test Scores? A Meta-Analytic Investigation

Peer reviewed

Direct link

Rios, Joseph A.; Deng, Jiayi; Ihlenfeldt, Samuel D. – Educational Assessment, 2022

The present meta-analysis sought to quantify the average degree of aggregated test score distortion due to rapid guessing (RG). Included studies group-administered a low-stakes cognitive assessment, identified RG via response times, and reported the rate of examinees engaging in RG, the percentage of RG responses observed, and/or the degree of…

Descriptors: Guessing (Tests), Testing Problems, Scores, Item Response Theory

Investigating Repeater Effects on Small Sample Equating: Include or Exclude?

Peer reviewed

Direct link

Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020

Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…

Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Measurement Invariance of Scores on the Teacher Stress Scale: International Sample of PreK-12 Teachers

Peer reviewed

Direct link

Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025

Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…

Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational Measurement:…	5
ETS Research Report Series	3
Journal of Educational and…	3
American Journal of Distance…	2
Applied Measurement in…	2
Developmental Psychology	2
Journal of Psychoeducational…	2
LEARN Journal: Language…	2
Language Assessment Quarterly	2
Language Testing	2
Language Testing in Asia	2
Online Submission	2
ProQuest LLC	2
School Psychology…	2
Annenberg Institute for…	1
Assessment in Education:…	1
Educational Assessment	1
Educational Assessment,…	1
Grantee Submission	1
International Journal for the…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal on…	1
More ▼

He, Lianzhen	2
Kim, Sooyeon	2
Rios, Joseph A.	2
Sinharay, Sandip	2
Allen, David	1
Allison Ames	1
An, Chen	1
Andrés Christiansen	1
Asib, Abdul	1
Attali, Yigal	1
Baig, Basim	1
Baird, Jo-Anne	1
Beck, Dennis	1
Benton, Tom	1
Brandon Crawford	1
Braun, Henry	1
Camenares, Devin	1
Camilla Rjosk	1
Campione-Barr, Nicole	1
Canivez, Gary L.	1
Carlsen, Cecilie Hamnes	1
Chanchula, Nawiya	1
Chang, Kuo-Feng	1
Chen, Yunxiao	1
Deng, Jiayi	1
More ▼