ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	14

Descriptor

Item Analysis	178
Testing Problems	178
Test Items	83
Test Construction	67
Test Validity	55
Test Reliability	40
Test Bias	37
Latent Trait Theory	35
Multiple Choice Tests	32
Achievement Tests	31
Higher Education	26
Response Style (Tests)	26
Test Interpretation	26
Mathematical Models	24
Scores	22
Difficulty Level	20
Elementary Secondary Education	20
Statistical Analysis	20
Scoring	19
Standardized Tests	16
Criterion Referenced Tests	15
Test Format	15
Testing	15
Error of Measurement	14
Guessing (Tests)	14
More ▼

Publication Type

Reports - Research	105
Journal Articles	52
Speeches/Meeting Papers	48
Reports - Evaluative	18
Reports - Descriptive	11
Opinion Papers	6
Guides - Non-Classroom	5
Information Analyses	3
Tests/Questionnaires	3
Books	2
Dissertations/Theses -…	2
Guides - Classroom - Teacher	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	3
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1

Audience

Researchers	24
Practitioners	2
Teachers	1

Location

California	2
Colombia	1
Florida	1
Israel	1
Netherlands	1
New Hampshire	1
New Zealand	1
Ohio	1
Russia	1
South Africa	1
Turkey	1
Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 178 results Save | Export

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

Disruption of the Relational and Item-Specific Processing Supports the Negative Outcomes of Multiple-Choice Testing with Additional Lures

Direct link

Paneerselvam, Bavani – ProQuest LLC, 2017

Multiple-choice retrieval practice with additional lures reduces retention on a later test (Roediger & Marsh, 2005). However, the mechanism underlying the negative outcomes with additional lures is poorly understood. Given that the positive outcomes of retrieval practice are associated with enhanced relational and item-specific processing…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Recall (Psychology)

Proficiency Exams in Teaching Turkish as a Foreign Language in TÖMER (Turkish and Foreign Languages Research and Application Centers)

Peer reviewed
PDF on ERIC

Download full text

Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020

Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…

Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

The Russian Uniform State Examination in Mathematics: The Latest Version

Peer reviewed

Direct link

Marushina, Albina – Journal of Mathematics Education at Teachers College, 2012

This paper aims to tell how the Russian national examination in mathematics (the Uniform State Examination or USE) has been conducted most recently. The author must say at once that the history of the system of secondary school graduation examinations or even the history of the USE will be covered only to the small degree that is necessary for…

Descriptors: Foreign Countries, Mathematics Tests, National Competency Tests, Secondary School Mathematics

Open-Ended Test Items Pose Challenges

Direct link

Sawchuk, Stephen – Education Week, 2010

Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…

Descriptors: Test Items, Federal Legislation, Scoring, Accountability

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

A Review of Academic Achievement Tests: Recommendations for Age Appropriate Administration

Direct link

Kozloff, Allison Burstein – ProQuest LLC, 2009

Comprehensive academic achievement tests are routinely used by school psychologists in psycho-educational assessment batteries to identify learning disabled students. A variety of assessment measures are used across age groups to determine if a discrepancy exists between academic achievement and intellectual functioning; however, among the most…

Descriptors: Intelligence, Educational Assessment, Academic Achievement, Achievement Tests

A Procedure for Sample-Free Item Analysis

Wright, Benjamin; Panchapakesan, Nargis – Educ Psychol Meas, 1969

Descriptors: Item Analysis, Psychometrics, Test Validity, Testing Problems

Do Item-Discrimination Indices Really Help Us To Improve Our Tests?

Peer reviewed

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2001

Item-discrimination indices are numbers calculated from test data that are used in assessing the effectiveness of individual test questions. This article asserts that the indices are so unreliable as to suggest that countless good questions may have been discarded over the years. It considers how the indices, and hence overall test reliability,…

Descriptors: Guessing (Tests), Item Analysis, Test Reliability, Testing Problems

Item Discrimination: When More Is Worse.

Peer reviewed

Masters, Geofferey N. – Journal of Educational Measurement, 1988

High item discrimination can indicate a special kind of measurement disturbance via an item that gives high-ability persons a special advantage. The measurement disturbance is described, which occurs when an item is sensitive to individual differences on a second, undesired dimension that is correlated with the variable intended to be measured.…

Descriptors: Academically Gifted, Item Analysis, Test Bias, Test Wiseness

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Journal of Educational…	8
Educational Measurement:…	5
Educational and Psychological…	5
Psychometrika	4
Applied Measurement in…	3
Applied Psychological…	3
Educational Technology	2
Evaluation in Education:…	2
Journal of Economic Education	2
Journal of Educational…	2
Journal of Experimental…	2
ProQuest LLC	2
Assessment & Evaluation in…	1
Assessment in Education:…	1
Educ Psychol Meas	1
Education Week	1
Educational Research Quarterly	1
Educational Research and…	1
Evaluation News	1
Evaluation and the Health…	1
Hispanic Journal of…	1
Instructional Science	1
International Journal for the…	1
J Educ Meas	1
Journal of Consulting and…	1
More ▼

Lord, Frederic M.	5
Jaeger, Richard M.	3
Wainer, Howard	3
Wright, Benjamin D.	3
Choppin, Bruce	2
Doolittle, Allen E.	2
Green, Donald Ross	2
Hoover, H. D.	2
Jolly, S. Jean	2
Klein, Stephen P.	2
Loyd, Brenda H.	2
Reckase, Mark D.	2
Sarvela, Paul D.	2
Scheuneman, Janice	2
Secolsky, Charles	2
Waller, Michael I.	2
Wilson, Mark	2
Adkins, Dorothy C.	1
Altepeter, Tom	1
Anderson, Frances E.	1
Anderson, Lorin W.	1
Andrich, David	1
Andrulis, Richard S.	1
Angoff, William H.	1
More ▼

SAT (College Admission Test)	6
Stanford Achievement Tests	5
ACT Assessment	4
Iowa Tests of Basic Skills	4
California Achievement Tests	3
Graduate Record Examinations	3
Metropolitan Achievement Tests	3
Sequential Tests of…	2
Alabama High School…	1
Brazelton Neonatal Assessment…	1
Comprehensive Tests of Basic…	1
Expressive One Word Picture…	1
Gates MacGinitie Reading Tests	1
Kaufman Test of Educational…	1
Law School Admission Test	1
Lorge Thorndike Intelligence…	1
Metropolitan Readiness Tests	1
Minnesota Multiphasic…	1
Peabody Picture Vocabulary…	1
Slosson Intelligence Test	1
Stanford Binet Intelligence…	1
Tennessee Self Concept Scale	1
Texas Assessment of Academic…	1
Wechsler Adult Intelligence…	1
Wechsler Individual…	1
More ▼