ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	18
Since 2017 (last 10 years)	44
Since 2007 (last 20 years)	74

Descriptor

Test Items	358
Testing Problems	358
Test Construction	162
Item Analysis	84
Test Validity	72
Higher Education	67
Test Bias	65
Multiple Choice Tests	61
Test Format	60
Elementary Secondary Education	55
Difficulty Level	53
Test Reliability	53
Foreign Countries	50
Computer Assisted Testing	47
Achievement Tests	46
Latent Trait Theory	43
Scores	40
Item Response Theory	38
Mathematical Models	37
Adaptive Testing	36
Test Interpretation	31
Testing	30
Scoring	29
Psychometrics	27
Guessing (Tests)	26
More ▼

Publication Type

Reports - Research	196
Journal Articles	146
Speeches/Meeting Papers	100
Reports - Evaluative	78
Reports - Descriptive	23
Opinion Papers	21
Guides - Non-Classroom	18
Information Analyses	16
Guides - Classroom - Teacher	10
Tests/Questionnaires	10
Books	7
Collected Works - General	5
Numerical/Quantitative Data	5
Dissertations/Theses -…	3
Collected Works - Proceedings	2
Guides - Classroom - Learner	2
Collected Works - Serials	1
ERIC Publications	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	13
Secondary Education	13
Elementary Secondary Education	9
Elementary Education	4
High Schools	3
Adult Education	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 6	1
Grade 9	1
Primary Education	1
More ▼

Audience

Researchers	37
Practitioners	18
Teachers	11
Students	3
Counselors	2
Administrators	1

Location

Netherlands	6
Canada	4
United Kingdom (Great Britain)	4
Germany	3
Sweden	3
United Kingdom	3
United States	3
Colombia	2
Japan	2
Latin America	2
South Africa	2
Arizona	1
Brazil	1
Burma	1
California	1
China	1
Hawaii	1
Hong Kong	1
Indonesia	1
Kentucky	1
Massachusetts	1
New Jersey	1
New Zealand	1
Russia	1
South Korea	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Individuals with Disabilities…	2
Education for All Handicapped…	1
Elementary and Secondary…	1
Immigration Reform and…	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 358 results Save | Export

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Population Invariance in Composite-Score Equating with the Random Groups Design

Direct link

Chang, Kuo-Feng – ProQuest LLC, 2022

This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…

Descriptors: Test Items, Equated Scores, Methods, Design

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Evaluating Research Reports on the Qualities of Tests of English Language Skills in Indonesian Schools: A Systematic Review

Peer reviewed
PDF on ERIC

Download full text

Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025

The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Simultaneous Detection of Cheaters and Compromised Items Using a Biclustering Approach

Peer reviewed

Direct link

Hyeryung Lee; Walter P. Vispoel – Journal of Educational Measurement, 2025

Traditional methods for detecting cheating on assessments tend to focus on either identifying cheaters or compromised items in isolation, overlooking their interconnection. In this study, we present a novel biclustering approach that simultaneously detects both cheaters and compromised items by identifying coherent subgroups of examinees and items…

Descriptors: Identification, Cheating, Test Wiseness, Test Items

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Spoilt for Choice? Issues around the Use and Comparability of Optional Exam Questions

Peer reviewed

Direct link

Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019

For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…

Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries

Low Stakes, High Risks: The Problem of Intertemporal Validity of PISA in Latin America

Peer reviewed

Direct link

Rivas, Axel; Scasso, Martín Guillermo – Journal of Education Policy, 2021

Since 2000, the PISA test implemented by OECD has become the prime benchmark for international comparisons in education. The 2015 PISA edition introduced methodological changes that altered the nature of its results. PISA made no longer valid non-reached items of the final part of the test, assuming that those unanswered questions were more a…

Descriptors: Test Validity, Computer Assisted Testing, Foreign Countries, Achievement Tests

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Problematizing the Measurement of Gender Identity in K-12 Education Survey Research: A Systematic Review

Peer reviewed

Direct link

Mario I. Suárez – Educational Studies: Journal of the American Educational Studies Association, 2024

The increase in youth's self-identification as trans in the United States and Canada has created new urgency in schools to meet the needs of these students, yet education survey researchers have yet to find ways to assess their educational outcomes based on sex and gender. In this critical systematic review, I provide an overview of surveys from…

Descriptors: Measures (Individuals), Sexual Identity, Identification (Psychology), LGBTQ People

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 24

Journal of Educational…	23
Educational Measurement:…	12
Educational and Psychological…	9
Applied Measurement in…	5
Journal of Educational and…	5
Journal of Experimental…	4
Applied Psychological…	3
ETS Research Report Series	3
Economics	3
Language Testing in Asia	3
ProQuest LLC	3
Assessment in Education:…	2
International Journal of…	2
International Journal of…	2
Journal of Economic Education	2
Journal of Educational…	2
Writing Program Administration	2
AERA Online Paper Repository	1
Alberta Journal of…	1
Annual Review of Applied…	1
Arts Education Policy Review	1
Biochemical Education	1
British Journal of Language…	1
Business and Professional…	1
Canadian Journal of Education	1
More ▼

Hambleton, Ronald K.	7
Stocking, Martha L.	7
Wainer, Howard	7
Lord, Frederic M.	5
Plake, Barbara S.	4
Wilcox, Rand R.	4
Wise, Steven L.	4
Davey, Tim	3
Jaeger, Richard M.	3
Kelderman, Henk	3
Mills, Craig N.	3
Parshall, Cynthia G.	3
Sarvela, Paul D.	3
Secolsky, Charles	3
Sinharay, Sandip	3
Smith, Richard M.	3
van der Linden, Wim J.	3
Boekkooi-Timminga, Ellen	2
Childs, Ruth A.	2
Debeer, Dries	2
Diamond, Esther E.	2
Ebel, Robert L.	2
Frary, Robert B.	2
Gilmer, Jerry S.	2
More ▼

National Assessment of…	9
SAT (College Admission Test)	9
Program for International…	7
Graduate Record Examinations	4
ACT Assessment	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Comprehensive Tests of Basic…	2
Graduate Management Admission…	2
Iowa Tests of Basic Skills	2
New Jersey College Basic…	2
Sequential Tests of…	2
Wechsler Adult Intelligence…	2
Wechsler Intelligence Scale…	2
Advanced Placement…	1
Armed Services Vocational…	1
California Achievement Tests	1
Expressive One Word Picture…	1
Eysenck Personality Inventory	1
Gates MacGinitie Reading Tests	1
High School Longitudinal…	1
Massachusetts Comprehensive…	1
Medical College Admission Test	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
More ▼