ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	18

Descriptor

Item Analysis	38
Test Format	38
Test Validity	38
Test Items	27
Test Reliability	22
Test Construction	16
Multiple Choice Tests	10
Difficulty Level	7
Foreign Countries	7
Higher Education	7
Language Tests	7
Scoring	6
Mathematics Tests	5
Psychometrics	5
Science Tests	5
Secondary Education	5
Alternative Assessment	4
Correlation	4
Grade 8	4
Guessing (Tests)	4
Item Response Theory	4
Second Language Learning	4
Student Evaluation	4
Test Bias	4
Test Interpretation	4
More ▼

Publication Type

Reports - Research	25
Journal Articles	21
Speeches/Meeting Papers	7
Reports - Descriptive	4
Reports - Evaluative	4
Tests/Questionnaires	4
Guides - Non-Classroom	3
Information Analyses	2
Opinion Papers	2
Numerical/Quantitative Data	1

Education Level

Secondary Education	8
Higher Education	7
Postsecondary Education	7
Elementary Education	4
Junior High Schools	4
Middle Schools	4
Elementary Secondary Education	3
Grade 8	3
High Schools	3
Early Childhood Education	2
Grade 3	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Intermediate Grades	2
Primary Education	2
More ▼

Audience

Practitioners	5
Researchers	3
Teachers	3
Administrators	1

Location

Canada	1
Georgia	1
Japan	1
Mexico	1
Nebraska	1
New York	1
New York (Albany)	1
New York (Buffalo)	1
New York (New York)	1
New York (Rochester)	1
New York (Syracuse)	1
North Dakota	1
Turkey	1
United Kingdom	1
United Kingdom (Belfast)	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	1
Beck Depression Inventory	1
Cornell Critical Thinking Test	1
Embedded Figures Test	1
Graduate Management Admission…	1
Program for International…	1
Rosenberg Self Esteem Scale	1
Sequential Tests of…	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Effects of Data-Collection Designs in the Comparison of Computer-Based and Paper-Based Tests

Peer reviewed

Direct link

Arce-Ferrer, Alvaro J.; Bulut, Okan – Journal of Experimental Education, 2019

This study investigated the performance of four widely used data-collection designs in detecting test-mode effects (i.e., computer-based versus paper-based testing). The experimental conditions included four data-collection designs, two test-administration modes, and the availability of an anchor assessment. The test-level and item-level results…

Descriptors: Data Collection, Test Construction, Test Format, Computer Assisted Testing

The Development and Validation of a Lemma-Based Yes/No Vocabulary Size Test

Peer reviewed

Direct link

Masrai, Ahmed – SAGE Open, 2022

Vocabulary size measures serve important functions, not only with respect to placing learners at appropriate levels on language courses but also with a view to examining the progress of learners. One of the widely reported formats suitable for these purposes is the Yes/No vocabulary test. The primary aim of this study was to introduce and provide…

Descriptors: Vocabulary Development, Language Tests, English (Second Language), Second Language Learning

Preliminary Evaluation of the Psychometric Quality of HEIghten Quantitative Literacy

Peer reviewed
PDF on ERIC

Download full text

Katrina C. Roohr; HyeSun Lee; Jun Xu; Ou Lydia Liu; Zhen Wang – Numeracy, 2017

Quantitative literacy has been identified as an important student learning outcome (SLO) by both the higher education and workforce communities. This paper aims to provide preliminary evidence of the psychometric quality of the pilot forms for "HEIghten" quantitative literacy, a next-generation SLO assessment for students in higher…

Descriptors: Psychometrics, Numeracy, Test Items, Item Analysis

Validity and Reliability of Scores Obtained on Multiple-Choice Questions: Why Functioning Distractors Matter

Peer reviewed
PDF on ERIC

Download full text

Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016

Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…

Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Improving the Factor Structure of Psychological Scales: The Expanded Format as an Alternative to the Likert Scale Format

Peer reviewed

Direct link

Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016

Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…

Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items

Development of a Test of Scientific Argumentation

Peer reviewed
PDF on ERIC

Download full text

Frey, Bruce B.; Ellis, James D.; Bulgreen, Janis A.; Hare, Jana Craig; Ault, Marilyn – Electronic Journal of Science Education, 2015

"Scientific argumentation," defined as the ability to develop and analyze scientific claims, support claims with evidence from investigations of the natural world, and explain and evaluate the reasoning that connects the evidence to the claim, is a critical component of current science standards and is consistent with "Common Core…

Descriptors: Test Construction, Science Tests, Persuasive Discourse, Science Process Skills

The Creation and Validation of a Listening Vocabulary Levels Test

Peer reviewed

Direct link

McLean, Stuart; Kramer, Brandon; Beglar, David – Language Teaching Research, 2015

An important gap in the field of second language vocabulary assessment concerns the lack of validated tests measuring aural vocabulary knowledge. The primary purpose of this study is to introduce and provide preliminary validity evidence for the Listening Vocabulary Levels Test (LVLT), which has been designed as a diagnostic tool to measure…

Descriptors: Test Construction, Test Validity, English (Second Language), Second Language Learning

Rater Perceptions of Bias Using the Multiple Mini-Interview Format: A Qualitative Study

Peer reviewed
PDF on ERIC

Download full text

Alweis, Richard L.; Fitzpatrick, Caroline; Donato, Anthony A. – Journal of Education and Training Studies, 2015

Introduction: The Multiple Mini-Interview (MMI) format appears to mitigate individual rater biases. However, the format itself may introduce structural systematic bias, favoring extroverted personality types. This study aimed to gain a better understanding of these biases from the perspective of the interviewer. Methods: A sample of MMI…

Descriptors: Interviews, Interrater Reliability, Qualitative Research, Semi Structured Interviews

New York State Alternate Assessment Technical Report, 2014-15

Download full text

New York State Education Department, 2015

This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…

Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	2
Journal of Educational…	2
New York State Education…	2
Educational Research and…	1
Educational Sciences: Theory…	1
Electronic Journal of Science…	1
Journal of Dental Education	1
Journal of Economic Education	1
Journal of Education and…	1
Journal of Experimental…	1
Journal of Psychoeducational…	1
Journal of the Scholarship of…	1
Language Assessment Quarterly	1
Language Teaching Research	1
Language Testing	1
Nebraska Department of…	1
Numeracy	1
Practitioner Research in…	1
Psychometrika	1
Review of Educational Research	1
SAGE Open	1
More ▼

Benson, Jeri	2
Huntley, Renee M.	2
Abramzon, Andrea	1
Ali, Syed Haris	1
Allalouf, Avi	1
Alweis, Richard L.	1
Arce-Ferrer, Alvaro J.	1
Ault, Marilyn	1
Austin, Joe Dan	1
Beglar, David	1
Berberoglu, Giray	1
Bulgreen, Janis A.	1
Bulut, Okan	1
Camilli, Gregory	1
Carlson, James E.	1
Carr, Patrick A.	1
David Bell	1
Donato, Anthony A.	1
Dorans, Neil J.	1
Ellis, James D.	1
Fitzpatrick, Caroline	1
Frey, Bruce B.	1
Hare, Jana Craig	1
Henk, William A.	1
More ▼