ERIC - Search Results

Publication Date

In 2025	0
Since 2024	11
Since 2021 (last 5 years)	38
Since 2016 (last 10 years)	103
Since 2006 (last 20 years)	232

Descriptor

Scores	387
Test Bias	387
Test Items	117
Test Validity	95
Standardized Tests	72
Item Response Theory	70
College Entrance Examinations	64
Test Reliability	64
Comparative Analysis	63
Foreign Countries	58
Statistical Analysis	58
Higher Education	47
Test Construction	47
Racial Differences	45
Evaluation Methods	44
Achievement Tests	41
Correlation	40
Academic Achievement	39
Item Analysis	39
Mathematics Tests	38
Student Evaluation	38
Gender Differences	36
Testing	36
Psychometrics	35
Testing Problems	35
More ▼

Education Level

Higher Education	56
Postsecondary Education	46
Secondary Education	46
High Schools	34
Elementary Education	32
Elementary Secondary Education	25
Grade 4	16
Middle Schools	16
Grade 8	15
Grade 7	12
Intermediate Grades	11
Junior High Schools	11
Grade 3	9
Grade 5	9
Grade 9	9
Early Childhood Education	7
Grade 6	7
Primary Education	7
Grade 10	5
Grade 1	2
Grade 11	2
Grade 12	2
Adult Education	1
Grade 2	1
Kindergarten	1
More ▼

Audience

Researchers	10
Practitioners	5
Teachers	4
Administrators	2
Community	1
Counselors	1
Parents	1
Students	1

Location

Canada	8
California	6
Iran	6
Texas	6
Australia	5
Florida	5
Turkey	5
United States	5
Indonesia	3
Netherlands	3
New Mexico	3
Pennsylvania	3
United Kingdom (England)	3
California (San Francisco)	2
China	2
Colombia	2
Germany	2
Israel	2
Italy	2
Michigan	2
New York	2
Spain	2
United Kingdom	2
Alabama	1
Arizona	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Elementary and Secondary…	2
Pell Grant Program	2
Civil Rights Act 1964 Title…	1
Elementary and Secondary…	1
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 387 results Save | Export

A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias from Rapid Guessing Behavior

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2024

Rapid guessing (RG) is a form of non-effortful responding that is characterized by short response latencies. This construct-irrelevant behavior has been shown in previous research to bias inferences concerning measurement properties and scores. To mitigate these deleterious effects, a number of response time threshold scoring procedures have been…

Descriptors: Reaction Time, Scores, Item Response Theory, Guessing (Tests)

Method Bias Mechanisms and Procedural Remedies

Peer reviewed

Direct link

Minghui Yao; Yunjie Xu – Sociological Methods & Research, 2024

As a crucial method in organizational and social behavior research, self-report surveys must manage method bias. Method biases are distorted scores in survey response, distorted variance in variables, and distorted relational estimates between variables caused by method designs. Studies on method bias have focused on "post hoc"…

Descriptors: Statistical Bias, Social Science Research, Questionnaires, Test Bias

Artificial Intelligence and Educational Measurement: Opportunities and Threats

Peer reviewed

Direct link

Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024

I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…

Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education

A Nonparametric Composite Group DIF Index for Focal Groups Stemming from Multicategorical Variables

Peer reviewed

Direct link

Corinne Huggins-Manley; Anthony W. Raborn; Peggy K. Jones; Ted Myers – Journal of Educational Measurement, 2024

The purpose of this study is to develop a nonparametric DIF method that (a) compares focal groups directly to the composite group that will be used to develop the reported test score scale, and (b) allows practitioners to explore for DIF related to focal groups stemming from multicategorical variables that constitute a small proportion of the…

Descriptors: Nonparametric Statistics, Test Bias, Scores, Statistical Significance

A Three-Step DIF Analysis of a Reading Comprehension Test across Regional Dialects to Improve Test Score Validity

Peer reviewed

Direct link

Paula Elosua – Language Assessment Quarterly, 2024

In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…

Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity

PPSE P21 and P10 Calculation Method and Related Issues

Peer reviewed
PDF on ERIC

Download full text

Celen, Umit – International Journal of Assessment Tools in Education, 2021

This study examined the calculation methods of P121 and P10 scores used in teacher appointments. The statistics regarding the Public Personnel Selection Examination (PPSE) subtests used by Measurement, Selection and Placement Center (MSPC) in 2018, 2019 and 2020 were accessed from the website of the institution. The parameters not published on…

Descriptors: Teacher Placement, Scores, Teacher Competency Testing, Foreign Countries

Fairness of Using Different English Accents: The Effect of Shared L1s in Listening Tasks of the Duolingo English Test

Peer reviewed

Direct link

Okim Kang; Xun Yan; Maria Kostromitina; Ron Thomson; Talia Isaacs – Language Testing, 2024

This study aimed to answer an ongoing validity question related to the use of nonstandard English accents in international tests of English proficiency and associated issues of test fairness. More specifically, we examined (1) the extent to which different or shared English accents had an impact on listeners' performances on the Duolingo listening…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Nonstandard Dialects

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

When Should Individual Ability Estimates Be Reported if Rapid Guessing Is Present?

Peer reviewed

Direct link

Rios, Joseph A. – Applied Measurement in Education, 2022

Testing programs are confronted with the decision of whether to report individual scores for examinees that have engaged in rapid guessing (RG). As noted by the "Standards for Educational and Psychological Testing," this decision should be based on a documented criterion that determines score exclusion. To this end, a number of heuristic…

Descriptors: Testing, Guessing (Tests), Academic Ability, Scores

Ordinal Approaches to Decomposing Between-Group Test Score Disparities

Peer reviewed

Direct link

Quinn, David M.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

The estimation of test score "gaps" and gap trends plays an important role in monitoring educational inequality. Researchers decompose gaps and gap changes into within- and between-school portions to generate evidence on the role schools play in shaping these inequalities. However, existing decomposition methods assume an equal-interval…

Descriptors: Scores, Tests, Achievement Gap, Equal Education

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

An Exploration of Comparability Issues in Educational Research: Scale Linking, Equating, and Propensity Score Weighting

Direct link

Wu, Tong – ProQuest LLC, 2023

This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…

Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Using Automated Analysis to Assess Middle School Students' Competence with Scientific Argumentation

Peer reviewed

Direct link

Christopher D. Wilson; Kevin C. Haudek; Jonathan F. Osborne; Zoë E. Buck Bracey; Tina Cheuk; Brian M. Donovan; Molly A. M. Stuhlsatz; Marisol M. Santiago; Xiaoming Zhai – Journal of Research in Science Teaching, 2024

Argumentation is fundamental to science education, both as a prominent feature of scientific reasoning and as an effective mode of learning--a perspective reflected in contemporary frameworks and standards. The successful implementation of argumentation in school science, however, requires a paradigm shift in science assessment from the…

Descriptors: Middle School Students, Competence, Science Process Skills, Persuasive Discourse

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Educational and Psychological…	17
Journal of Educational…	16
ProQuest LLC	15
ETS Research Report Series	14
International Journal of…	14
College Board	8
Applied Measurement in…	7
Educational Assessment	7
Educational Measurement:…	7
Journal of Educational and…	6
ACT, Inc.	5
Applied Psychological…	5
College Entrance Examination…	5
Partnership for Assessment of…	5
Harvard Educational Review	4
Journal of Blacks in Higher…	4
Language Testing	4
Online Submission	4
Assessment	3
Educational Research and…	3
Educational Testing Service	3
Language Assessment Quarterly	3
New Meridian Corporation	3
Psychological Assessment	3
SAGE Open	3
More ▼

Dorans, Neil J.	7
Liu, Ou Lydia	5
Cullinan, Douglas	4
Epstein, Michael H.	4
Lambert, Matthew C.	4
Ercikan, Kadriye	3
Finch, W. Holmes	3
French, Brian F.	3
Gonzalez-Tamayo, Eulogio	3
Linn, Robert L.	3
Oliveri, Maria Elena	3
Rios, Joseph A.	3
Sireci, Stephen G.	3
Stone, Elizabeth	3
Weiss, David J.	3
Zhang, Jinming	3
Zumbo, Bruno D.	3
Ackerman, Terry	2
Allen, Jeff	2
Beretvas, S. Natasha	2
Camilli, Gregory	2
Chiu, Ting-Wei	2
DeMars, Christine E.	2
Federer, Meghan Rector	2
More ▼

Journal Articles	243
Reports - Research	227
Reports - Evaluative	69
Opinion Papers	33
Reports - Descriptive	31
Speeches/Meeting Papers	23
Dissertations/Theses -…	15
Information Analyses	14
Numerical/Quantitative Data	11
Tests/Questionnaires	8
Books	5
Guides - Non-Classroom	5
Collected Works - General	3
Reports - General	3
Collected Works - Serials	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
ERIC Publications	1
Guides - General	1
Historical Materials	1
More ▼

SAT (College Admission Test)	39
ACT Assessment	16
Program for International…	8
Graduate Record Examinations	7
Wechsler Intelligence Scale…	7
National Assessment of…	6
Test of English as a Foreign…	5
Peabody Picture Vocabulary…	3
Progress in International…	3
Trends in International…	3
Advanced Placement…	2
Beck Depression Inventory	2
California Achievement Tests	2
International English…	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
National Teacher Examinations	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Stanford Binet Intelligence…	2
Test of English for…	2
Woodcock Johnson Tests of…	2
ACT Interest Inventory	1
More ▼