ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 226 to 240 of 9,547 results Save | Export

Item Type and Survey Mode Comparability: An Analysis of Measurement Invariance between Item Response Types and Survey Modes

Direct link

Jackson, Kayla – ProQuest LLC, 2023

Prior research highlights the benefits of multimode surveys and best practices for item-by-item (IBI) and matrix-type survey items. Some researchers have explored whether mode differences for online and paper surveys persist for these survey item types. However, no studies discuss measurement invariance when both item types and online modes are…

Descriptors: Test Items, Surveys, Error of Measurement, Item Response Theory

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Effect of Missing Data on Test Equating Methods Under NEAT Design

Peer reviewed
PDF on ERIC

Download full text

Semih Asiret; Seçil Ömür Sünbül – International Journal of Psychology and Educational Studies, 2023

In this study, it was aimed to examine the effect of missing data in different patterns and sizes on test equating methods under the NEAT design for different factors. For this purpose, as part of this study, factors such as sample size, average difficulty level difference between the test forms, difference between the ability distribution,…

Descriptors: Research Problems, Data, Test Items, Equated Scores

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

Visualizing Depth of Student Conceptual Understanding Using Subquestions and Alluvial Diagrams

Peer reviewed

Direct link

Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2023

We aim to graphically analyze the depth of conceptual understanding behind the Force Concept Inventory (FCI) responses of students, focusing on three questions (questions 1, 15, and 28). In our study, we created and implemented subquestions to clarify and quantify the students' reasoning steps in reaching their responses to the original FCI…

Descriptors: Scientific Concepts, Concept Formation, Misconceptions, Visual Aids

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Is It Possible to Assess the Two-Domain Definition of the Broad Autism Phenotype Using the Available Measurement Tools?

Peer reviewed

Direct link

Godoy-Giménez, M.; González-Rodríguez, A.; Cañadas, F.; Estévez, A. F.; Sayans-Jiménez, P. – Journal of Autism and Developmental Disorders, 2022

Although, the operationalization of the autism spectrum disorder has been updated around two domains, the broad autism phenotype (BAP) one has not. Additionally, the items of the three common BAP measures, the Broad Autism Phenotype Questionnaire (BAPQ), the Autism Quotient, and the Social Responsiveness Scale (SRS), remain organized around a…

Descriptors: Autism, Pervasive Developmental Disorders, Measurement, Screening Tests

A Residual-Based Differential Item Functioning Detection Framework in Item Response Theory

Peer reviewed

Direct link

Lim, Hwanggyu; Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

Differential item functioning (DIF) of test items should be evaluated using practical methods that can produce accurate and useful results. Among a plethora of DIF detection techniques, we introduce the new "Residual DIF" (RDIF) framework, which stands out for its accessibility without sacrificing efficacy. This framework consists of…

Descriptors: Test Items, Item Response Theory, Identification, Robustness (Statistics)

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance under Model Misspecification

Peer reviewed

Direct link

Guastadisegni, Lucia; Cagnone, Silvia; Moustaki, Irini; Vasdekis, Vassilis – Educational and Psychological Measurement, 2022

This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach,…

Descriptors: Measurement, Statistical Analysis, Item Response Theory, Test Items

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

Generating Multiple Choice Questions with a Multi-Angle Question Answering Model

Peer reviewed
PDF on ERIC

Download full text

Direct link

Olney, Andrew M. – Grantee Submission, 2022

Multi-angle question answering models have recently been proposed that promise to perform related tasks like question generation. However, performance on related tasks has not been thoroughly studied. We investigate a leading model called Macaw on the task of multiple choice question generation and evaluate its performance on three angles that…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Models

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

The Development of Epistemological Understanding Revisited: Enhancing Reliability of the Tool by Using Only Abstract Items

Peer reviewed

Direct link

Zyluk, Natalia; Karpe, Karolina; Urbanski, Mariusz – SAGE Open, 2022

The aim of this paper is to describe the process of modification of the research tool designed for measuring the development of personal epistemology--"Standardized Epistemological Understanding Assessment" (SEUA). SEUA was constructed as an improved version of the instrument initially proposed by Kuhn et al. SEUA was proved to be a more…

Descriptors: Epistemology, Research Tools, Beliefs, Test Items

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

« Previous Page | Next Page »

Pages: 1 | ... | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼