ERIC - Search Results

Publication Date

In 2025	2
Since 2024	10
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	28

Descriptor

Error of Measurement	109
Testing Problems	109
Test Reliability	39
Scores	22
Test Interpretation	21
Test Validity	18
Higher Education	16
Test Construction	16
Test Items	16
Scoring	15
Item Analysis	14
Mathematical Models	14
Evaluation Methods	12
Statistical Analysis	12
Sampling	11
Achievement Tests	10
Elementary Secondary Education	10
Estimation (Mathematics)	10
Measurement Techniques	10
Test Bias	10
True Scores	10
Latent Trait Theory	9
Standardized Tests	9
Test Results	9
Item Response Theory	8
More ▼

Publication Type

Reports - Research	63
Journal Articles	55
Speeches/Meeting Papers	25
Reports - Evaluative	15
Reports - Descriptive	13
Opinion Papers	9
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
ERIC Publications	2
Tests/Questionnaires	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Information Analyses	1
Legal/Legislative/Regulatory…	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Secondary Education	3
Secondary Education	2
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1

Audience

Researchers	10
Practitioners	2

Location

Canada	1
Chile	1
Colombia	1
Ethiopia	1
Maine	1
Michigan	1
New Hampshire	1
Oregon	1
Thailand	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 109 results Save | Export

A Crash Course in Good and Bad Controls

Peer reviewed

Direct link

Carlos Cinelli; Andrew Forney; Judea Pearl – Sociological Methods & Research, 2024

Many students of statistics and econometrics express frustration with the way a problem known as "bad control" is treated in the traditional literature. The issue arises when the addition of a variable to a regression equation produces an unintended discrepancy between the regression coefficient and the effect that the coefficient is…

Descriptors: Regression (Statistics), Robustness (Statistics), Error of Measurement, Testing Problems

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

The Analysis of Marking Reliability through the Approach of Gauge Repeatability and Reproducibility (GR&R) Study: A Case of English-Speaking Test

Peer reviewed

Direct link

Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024

Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…

Descriptors: Foreign Countries, Students, English Language Learners, Speech

Evaluating Measurement Invariance of Students' Practices Regarding Online Information Questionnaire in PISA 2022: A Comparative Study Using MGCFA and Alignment Method

Peer reviewed

Direct link

Esra Sözer Boz – Education and Information Technologies, 2025

International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…

Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students

Preventing Satisficing: A Narrative Review

Peer reviewed

Direct link

Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024

Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…

Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation

The Effect of Student Examiner Errors on WAIS-IV and WISC-V Composite Scores

Direct link

Atehortua, Laura – ProQuest LLC, 2022

Intelligence tests are used in a variety of settings such as schools, clinics, and courts to assess the intellectual capacity of individuals of all ages. Intelligence tests are used to make high-stakes decisions such as special education placement, employment, eligibility for social security services, and determination of the death penalty.…

Descriptors: Adults, Intelligence Tests, Children, Error of Measurement

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Hurdles to Learning Assessment Quality: Their Detrimental Effects on Student Learning

Peer reviewed
PDF on ERIC

Download full text

Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024

The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…

Descriptors: Foreign Countries, College Faculty, College Students, Test Construction

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Review of the Use of Standardized Achievement Tests for Accountability Purposes in Education: The Colombia and Chile Cases

Direct link

Jose Antonio Mola Avila – ProQuest LLC, 2023

Accountability in education was implemented to improve poor learning outcomes by documenting and monitoring learning achievement results. In this process, external standardized achievement tests have played a central role, being the mechanism most frequently used to measure learning outcomes. However, several decades after its initial…

Descriptors: Foreign Countries, Standardized Tests, Achievement Tests, Accountability

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Exploring Student Sensemaking When Engaging with Anomalous Data

Peer reviewed

Direct link

Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024

In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…

Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments

Does the Effect of a Time Limit for Testing Impair Structural Investigations by Means of Confirmatory Factor Models?

Peer reviewed

Direct link

Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019

The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…

Descriptors: Timed Tests, Testing, Barriers, Testing Problems

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational and Psychological…	9
Journal of Educational…	5
Educational Measurement:…	4
International Journal of…	3
Journal of Experimental…	3
Applied Measurement in…	2
NCME Measurement in Education	2
Perceptual and Motor Skills	2
ProQuest LLC	2
Psychology in the Schools	2
School Psychology Review	2
American Educational Research…	1
Annenberg Institute for…	1
CBE - Life Sciences Education	1
Canadian Journal of School…	1
Child Abuse and Neglect: The…	1
Education and Information…	1
Educational Testing Service	1
Evaluation Review	1
Evaluation and Program…	1
Evaluation and Program…	1
Grantee Submission	1
International Association for…	1
International Journal of…	1
Intersection: A Journal at…	1
More ▼

Alderman, Donald L.	3
Gardner, Eric F.	2
Hambleton, Ronald K.	2
Livingston, Samuel A.	2
Lord, Frederic M.	2
Slate, John R.	2
Smith, Richard M.	2
Wright, Benjamin D.	2
Zimmerman, Donald W.	2
Adrian Adams	1
Altepeter, Tom	1
Alwin, Duane F.	1
Andrew Forney	1
Angela Johnson	1
Angoff, William H.	1
Atehortua, Laura	1
Atkinson, Leslie	1
Babcock, Ben	1
Backhoff, Eduardo	1
Barcikowski, Robert S.	1
Barford, Sean W.	1
Barker, Pierce	1
Bauer, Ernest A.	1
Belcher, Marcia	1
More ▼

Wechsler Intelligence Scale…	5
National Assessment of…	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	2
Alabama High School…	1
Armed Services Vocational…	1
College Level Academic Skills…	1
Cornell Critical Thinking Test	1
Expressive One Word Picture…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Metropolitan Achievement Tests	1
New Jersey College Basic…	1
Program for International…	1
Rod and Frame Test	1
Sequential Tests of…	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Vineland Adaptive Behavior…	1
Watson Glaser Critical…	1
More ▼