ERIC - Search Results

Publication Date

In 2025	82
Since 2024	330
Since 2021 (last 5 years)	1282
Since 2016 (last 10 years)	2746
Since 2006 (last 20 years)	4973

Descriptor

Test Items	9400
Test Construction	2673
Foreign Countries	2122
Item Response Theory	1843
Difficulty Level	1597
Item Analysis	1480
Test Validity	1375
Test Reliability	1152
Multiple Choice Tests	1134
Scores	1122
Computer Assisted Testing	1040
Comparative Analysis	1015
Test Format	945
Higher Education	873
Statistical Analysis	847
Achievement Tests	837
Mathematics Tests	827
Psychometrics	819
Test Bias	761
Models	747
Student Evaluation	723
Correlation	691
Language Tests	686
Evaluation Methods	665
Scoring	625
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1267
Postsecondary Education	1017
Secondary Education	892
Elementary Education	695
Middle Schools	409
High Schools	355
Elementary Secondary Education	354
Junior High Schools	310
Grade 8	250
Intermediate Grades	206
Grade 4	180
Early Childhood Education	171
Grade 5	133
Primary Education	121
Grade 7	113
Grade 3	108
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	51
Kindergarten	49
Adult Education	37
Grade 11	37
Grade 1	35
More ▼

Audience

Practitioners	653
Teachers	560
Researchers	249
Students	201
Administrators	79
Policymakers	21
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Canada	223
Turkey	221
Australia	155
Germany	114
United States	97
Florida	86
China	84
Taiwan	75
Indonesia	73
United Kingdom	70
Netherlands	64
California	63
Japan	63
Iran	62
United Kingdom (England)	57
South Africa	47
New York	45
Missouri	44
Oklahoma	44
Texas	42
South Korea	41
Malaysia	39
Israel	37
Sweden	37
Singapore	35
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Test Items X

Showing 46 to 60 of 9,400 results Save | Export

Evaluation of Response Probabilities along Studied Latent Dimensions: A Polytomous Item Extension

Peer reviewed

Direct link

Raykov, Tenko; Huber, Chuck; Marcoulides, George A.; Pusic, Martin; Menold, Natalja – Measurement: Interdisciplinary Research and Perspectives, 2021

A readily and widely applicable procedure is discussed that can be used to point and interval estimate the probabilities of particular responses on polytomous items at pre-specified points along underlying latent continua. The items are assumed thereby to be part of unidimensional multi-component measuring instruments that may contain also binary…

Descriptors: Probability, Computation, Test Items, Responses

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Reliability of the 2020 School Health Profiles Principal and Lead Health Education Teacher Questionnaires

Peer reviewed

Direct link

Sherry Everett Jones; Nancy D. Brener; Barbara Queen; Molly Hershey-Arista; William Harris; J. Michael Underwood – Journal of School Health, 2024

Background: School Health Profiles assesses school health policies and practices among US secondary schools. Methods: The 2020 School Health Profiles principal and teacher questionnaires were used for a test-retest reliability study. Cohen's kappa coefficients tested the agreement in dichotomous responses to each questionnaire variable at 2 time…

Descriptors: Administrator Surveys, Teacher Surveys, Questionnaires, Pretests Posttests

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

The Impact of Non-Effortful Responding on Item and Person Parameters in Item-Pool Scaling Linking

Peer reviewed

Direct link

Yue Liu; Zhen Li; Hongyun Liu; Xiaofeng You – Applied Measurement in Education, 2024

Low test-taking effort of examinees has been considered a source of construct-irrelevant variance in item response modeling, leading to serious consequences on parameter estimation. This study aims to investigate how non-effortful response (NER) influences the estimation of item and person parameters in item-pool scale linking (IPSL) and whether…

Descriptors: Item Response Theory, Computation, Simulation, Responses

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Under the Weather? The Effects of Temperature on Student Test Performance. EdWorkingPaper No. 24-910

Download full text

Deven Carlson; Adam Shepardson – Annenberg Institute for School Reform at Brown University, 2024

As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a…

Descriptors: Weather, Climate, Heat, Academic Achievement

Reevaluating the SIBTEST Classification Heuristics for Dichotomous Differential Item Functioning

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Ames, Allison; Crawford, Brandon; Liang, Xinya – Educational and Psychological Measurement, 2022

A simulation study was conducted to investigate the heuristics of the SIBTEST procedure and how it compares with ETS classification guidelines used with the Mantel-Haenszel procedure. Prior heuristics have been used for nearly 25 years, but they are based on a simulation study that was restricted due to computer limitations and that modeled item…

Descriptors: Test Bias, Heuristics, Classification, Statistical Analysis

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Population Invariance in Composite-Score Equating with the Random Groups Design

Direct link

Chang, Kuo-Feng – ProQuest LLC, 2022

This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…

Descriptors: Test Items, Equated Scores, Methods, Design

An Evaluation of Automatic Item Generation: A Case Study of Weak Theory Approach

Peer reviewed

Direct link

Fu, Yanyan; Choe, Edison M.; Lim, Hwanggyu; Choi, Jaehwa – Educational Measurement: Issues and Practice, 2022

This case study applied the "weak theory" of Automatic Item Generation (AIG) to generate isomorphic item instances (i.e., unique but psychometrically equivalent items) for a large-scale assessment. Three representative instances were selected from each item template (i.e., model) and pilot-tested. In addition, a new analytical framework,…

Descriptors: Test Items, Measurement, Psychometrics, Test Construction

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

It Ain't near 'Bout Fair: Re-Envisioning the Bias and Sensitivity Review Process from a Justice-Oriented Antiracist Perspective

Peer reviewed

Direct link

Randall, Jennifer – Educational Assessment, 2023

In a justice-oriented antiracist assessment process, attention to the disruption of white supremacy must occur at every stage--from construct articulation to score reporting. An important step in the assessment development process is the item review stage often referred to as Bias/Fairness and Sensitivity Review. I argue that typical approaches to…

Descriptors: Social Justice, Racism, Test Bias, Test Items

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 627

Educational and Psychological…	416
Journal of Educational…	352
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	144
Educational Measurement:…	126
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Psychometrika	93
Grantee Submission	92
Language Testing	92
International Journal of…	72
Journal of Psychoeducational…	70
Educational Assessment	69
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	49
Journal of Experimental…	45
Journal of Experimental…	36
International Journal of…	34
Physical Review Physics…	33
More ▼

Journal Articles	5756
Reports - Research	5463
Reports - Evaluative	1549
Speeches/Meeting Papers	1163
Reports - Descriptive	791
Tests/Questionnaires	759
Guides - Classroom - Teacher	470
Guides - Non-Classroom	258
Dissertations/Theses -…	251
Numerical/Quantitative Data	183
Information Analyses	176
Opinion Papers	164
Guides - Classroom - Learner	162
Books	50
Multilingual/Bilingual…	32
Collected Works - General	31
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	180
Program for International…	167
SAT (College Admission Test)	136
Trends in International…	111
Test of English as a Foreign…	83
Graduate Record Examinations	74
ACT Assessment	42
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	24
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
International English…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼