ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Item Analysis	83
Test Items	83
Testing Problems	83
Test Construction	41
Test Validity	23
Multiple Choice Tests	22
Test Bias	19
Difficulty Level	18
Higher Education	16
Latent Trait Theory	16
Test Reliability	16
Mathematical Models	14
Achievement Tests	13
Scores	13
Test Format	13
Item Banks	9
Response Style (Tests)	9
College Entrance Examinations	8
Test Interpretation	8
Criterion Referenced Tests	7
Standardized Tests	7
Test Length	7
Black Students	6
Elementary Secondary Education	6
Guessing (Tests)	6
More ▼

Publication Type

Reports - Research	54
Journal Articles	29
Speeches/Meeting Papers	28
Reports - Evaluative	11
Reports - Descriptive	6
Opinion Papers	4
Guides - Classroom - Teacher	2
Guides - Non-Classroom	2
Information Analyses	2
Dissertations/Theses -…	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Adult Education	1
Secondary Education	1

Audience

Researchers	15
Practitioners	1

Location

California	1
Colombia	1
New Zealand	1
Russia	1
South Africa	1
Turkey	1
Virginia	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)	4
ACT Assessment	3
Sequential Tests of…	2
Stanford Achievement Tests	2
California Achievement Tests	1
Expressive One Word Picture…	1
Gates MacGinitie Reading Tests	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

Disruption of the Relational and Item-Specific Processing Supports the Negative Outcomes of Multiple-Choice Testing with Additional Lures

Direct link

Paneerselvam, Bavani – ProQuest LLC, 2017

Multiple-choice retrieval practice with additional lures reduces retention on a later test (Roediger & Marsh, 2005). However, the mechanism underlying the negative outcomes with additional lures is poorly understood. Given that the positive outcomes of retrieval practice are associated with enhanced relational and item-specific processing…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Recall (Psychology)

Proficiency Exams in Teaching Turkish as a Foreign Language in TÖMER (Turkish and Foreign Languages Research and Application Centers)

Peer reviewed
PDF on ERIC

Download full text

Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020

Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…

Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

The Russian Uniform State Examination in Mathematics: The Latest Version

Peer reviewed

Direct link

Marushina, Albina – Journal of Mathematics Education at Teachers College, 2012

This paper aims to tell how the Russian national examination in mathematics (the Uniform State Examination or USE) has been conducted most recently. The author must say at once that the history of the system of secondary school graduation examinations or even the history of the USE will be covered only to the small degree that is necessary for…

Descriptors: Foreign Countries, Mathematics Tests, National Competency Tests, Secondary School Mathematics

Open-Ended Test Items Pose Challenges

Direct link

Sawchuk, Stephen – Education Week, 2010

Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…

Descriptors: Test Items, Federal Legislation, Scoring, Accountability

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Detecting and Interpreting Local Item Dependence Using a Family of Rasch Models.

Peer reviewed

Wilson, Mark – Applied Psychological Measurement, 1988

A method for detecting and interpreting disturbances of the local-independence assumption among items that share common stimulus material or other features is presented. Dichotomous and polytomous Rasch models are used to analyze structure of the learning outcome superitems. (SLD)

Descriptors: Item Analysis, Latent Trait Theory, Mathematical Models, Test Interpretation

Objective-Referenced-Test Rescore Decisions and Item Statistics: A Matter of Congruence.

Shannon, Gregory A. – 1983

Rescoring of Center for Occupational and Professional Assessment objective-referenced tests is decided largely by content experts selected by client organizations. A few of the test items, statistically flagged for review, are not rescored. Some of this incongruence could be due to the use of the biserial correlation (r-biserial) as an…

Descriptors: Adults, Criterion Referenced Tests, Item Analysis, Occupational Tests

A Note on the Potential for Bias in the Structure and Wording of Questionnaire Items.

Cantwell, Zita M. – Evaluation News, 1985

The wording and structure of questionnaire items can interact with specified sample categories based on evaluation goals and respondent characteristics. The effects of the interactions can restructure samples and introduce bias into the data analysis. These effects, and suggestions for avoiding them, are demonstrated for five types of…

Descriptors: Higher Education, Item Analysis, Questionnaires, Statistical Bias

Quantitative Methods Used in the Study of Item Bias.

Hills, John R. – 1984

The literature on item bias, i.e., the question of whether some items in tests favor one cultural group over another cultural group due to irrelevant factors, is reviewed and evaluated. All known references through 1981 are described including a large number of unpublished reports. Each method is described and the criticisms that have appeared in…

Descriptors: Evaluation Methods, Item Analysis, Racial Differences, Test Bias

Was There One Distractor Too Many?

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Statistics, 1984

A mathematics item on the Scholastic Aptitude Test (SAT) was found to be faulty and received wide publicity. A detailed investigation into its mathematical and psychometric properties is presented. It was found that the problem could be considered ambiguous but that almost no one noticed the ambiguity. (Author/JKS)

Descriptors: Classification, College Entrance Examinations, Geometry, High Schools

Problems in Scoring, Agreement among Raters, and Internal Consistency of Selected Marker Tests.

Peer reviewed

Rusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979

The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)

Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational Measurement:…	5
Educational and Psychological…	3
Journal of Educational…	3
Journal of Economic Education	2
Applied Psychological…	1
Education Week	1
Educational Research and…	1
Educational Technology	1
Evaluation News	1
Evaluation in Education:…	1
Instructional Science	1
International Journal for the…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Language and…	1
Journal of Mathematics…	1
Journal of Research and…	1
Language Testing	1
Nursing Outlook	1
Online Submission	1
ProQuest LLC	1
School Psychology Review	1
Spectrum	1
TESOL Quarterly	1
More ▼

Jaeger, Richard M.	2
Lord, Frederic M.	2
Reckase, Mark D.	2
Sarvela, Paul D.	2
Secolsky, Charles	2
Wainer, Howard	2
Altepeter, Tom	1
Anderson, Lorin W.	1
Bezirhan, Ummugul	1
Bhaskar, R.	1
Bolus, Roger	1
Bond, Lloyd	1
Bower, Ruth	1
Braswell, James	1
Bresnock, Anne E.	1
Broussard, Rolland L.	1
Cahen, Leonard S.	1
Camenares, Devin	1
Camilli, Gregory	1
Cantwell, Zita M.	1
Carter, Kathy	1
Chastain, Kenneth D.	1
Choppin, Bruce	1
Craig, Robert	1
More ▼