Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 40 |
Since 2006 (last 20 years) | 61 |
Descriptor
Scores | 387 |
Testing Problems | 387 |
Elementary Secondary Education | 104 |
Standardized Tests | 102 |
Achievement Tests | 86 |
Test Validity | 81 |
Test Interpretation | 80 |
College Entrance Examinations | 63 |
Test Reliability | 60 |
Test Results | 60 |
Higher Education | 57 |
More ▼ |
Source
Author
Jackson, Rex | 6 |
Hambleton, Ronald K. | 5 |
Sinharay, Sandip | 4 |
Alderman, Donald L. | 3 |
Breland, Hunter M. | 3 |
Cannell, John Jacob | 3 |
Koretz, Daniel | 3 |
Arter, Judith A. | 2 |
Barker, Pierce | 2 |
Beaton, Albert E. | 2 |
Beck, Michael D. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 30 |
Practitioners | 13 |
Administrators | 3 |
Teachers | 3 |
Parents | 2 |
Policymakers | 2 |
Community | 1 |
Counselors | 1 |
Location
Canada | 5 |
China | 5 |
Japan | 3 |
New Jersey | 3 |
Ohio | 3 |
United Kingdom | 3 |
Georgia | 2 |
New Hampshire | 2 |
Taiwan | 2 |
Thailand | 2 |
Arizona | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 6 |
No Child Left Behind Act 2001 | 6 |
Education Consolidation… | 1 |
Every Student Succeeds Act… | 1 |
Hawkins Stafford Act 1988 | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Paul T. von Hippel – Annenberg Institute for School Reform at Brown University, 2023
Longitudinal studies can produce biased estimates of learning if children miss tests. In an application to summer learning, we illustrate how missing test scores can create an illusion of large summer learning gaps when true gaps are close to zero. We demonstrate two methods that reduce bias by exploiting the correlations between missing and…
Descriptors: Testing Problems, Scores, Educational Research, Longitudinal Studies
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Laird, Robert D. – Developmental Psychology, 2020
Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Kalemdaroglu-Wheeler, Elif – ProQuest LLC, 2023
The purpose of this qualitative exploratory case study was to explore teachers' and administrators' perceptions of test score pollution deriving from COVID-19-related issues that may affect students' test scores on state-mandated standardized tests for grades six through 12 in a state along the Atlantic Coast of the United States. Four research…
Descriptors: Testing Problems, Scores, COVID-19, Pandemics
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022
Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…
Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items
Rios, Joseph A.; Deng, Jiayi; Ihlenfeldt, Samuel D. – Educational Assessment, 2022
The present meta-analysis sought to quantify the average degree of aggregated test score distortion due to rapid guessing (RG). Included studies group-administered a low-stakes cognitive assessment, identified RG via response times, and reported the rate of examinees engaging in RG, the percentage of RG responses observed, and/or the degree of…
Descriptors: Guessing (Tests), Testing Problems, Scores, Item Response Theory
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020
The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022
Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…
Descriptors: College Students, Student Evaluation, Tests, Test Items
Goedl, Patricia A.; Malla, Ganesh B. – American Journal of Distance Education, 2020
The purpose of this study was to empirically examine the grade distributions of proctored and unproctored exams in an online learning environment. The authors statistically compared exam scores and time to complete exams for proctored and unproctored exams in two online courses. Student data were collected from an online section of introductory…
Descriptors: Supervision, Computer Assisted Testing, Testing Problems, Grade Inflation
Schneider, W. Joel; Roman, Zachary – Journal of Psychoeducational Assessment, 2018
We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…
Descriptors: Statistical Data, Simulation, Testing, Scores