ERIC - Search Results

Publication Date

In 2025

125

Descriptor

Test Items	125
Foreign Countries	57
Test Construction	48
Test Validity	41
Test Reliability	37
Item Response Theory	33
Item Analysis	26
Psychometrics	23
Difficulty Level	22
College Students	19
Factor Analysis	19
Artificial Intelligence	17
Computer Assisted Testing	17
Goodness of Fit	17
Accuracy	16
Measures (Individuals)	15
Test Format	15
Language Tests	14
Science Tests	14
English (Second Language)	12
Factor Structure	12
Multiple Choice Tests	12
Scores	12
Comparative Analysis	11
Questionnaires	11
More ▼

Publication Type

Journal Articles	117
Reports - Research	112
Tests/Questionnaires	11
Reports - Descriptive	5
Reports - Evaluative	5
Information Analyses	4
Books	1
Collected Works - General	1

Education Level

Higher Education	42
Postsecondary Education	42
Secondary Education	25
Elementary Education	12
Junior High Schools	10
Middle Schools	10
High Schools	6
Early Childhood Education	4
Grade 8	4
Primary Education	3
Adult Education	2
Elementary Secondary Education	2
Grade 1	1
Grade 10	1
Grade 12	1
Grade 2	1
Grade 3	1
Preschool Education	1
More ▼

Audience

Administrators	1
Counselors	1
Policymakers	1
Practitioners	1
Researchers	1
Teachers	1

Location

Turkey	8
China	7
Indonesia	7
Iran	5
United Kingdom	5
Germany	4
Japan	3
Norway	3
Taiwan	3
Hong Kong	2
India	2
Italy	2
Oman	2
Peru	2
Saudi Arabia	2
South Korea	2
Sweden	2
United Kingdom (England)	2
United States	2
Africa	1
Austria	1
Bosnia and Herzegovina	1
Brazil	1
Bulgaria	1
California	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
International English…	3
ACT Assessment	2
National Assessment of…	2
Ages and Stages Questionnaires	1
Big Five Inventory	1
Career Thoughts Inventory	1
Force Concept Inventory	1
International Civic and…	1
Pearson Test of English…	1
Remote Associates Test	1
Social Skills Improvement…	1
Stages of Concern…	1
Teaching and Learning…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 125 results Save | Export

A Comparison of Anchor Selection Strategies for DIF Analysis

Peer reviewed

Direct link

Haeju Lee; Kyung Yong Kim – Journal of Educational Measurement, 2025

When no prior information of differential item functioning (DIF) exists for items in a test, either the rank-based or iterative purification procedure might be preferred. The rank-based purification selects anchor items based on a preliminary DIF test. For a preliminary DIF test, likelihood ratio test (LRT) based approaches (e.g.,…

Descriptors: Test Items, Equated Scores, Test Bias, Accuracy

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

The Frequency, Type, and Function of Visual Displays in Upper Elementary Standardized Science Tests

Peer reviewed

Direct link

Daibao Guo; Katherine Landau Wright; Lianne Josbacher; Eun Hye Son – Elementary School Journal, 2025

Limited research has explored the use of visual displays (ViDis) in science tests, making it challenging to know how these tests align with classroom instruction and what skills students need to be successful on these tests. Therefore, the current study aims to describe the use of ViDis in upper elementary grade standardized science tests. We…

Descriptors: Standardized Tests, Science Tests, Elementary Education, Science Education

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Civic and Citizenship Education, Global Citizenship Education, and Education for Sustainable Development: An Analysis of Their Integrated Conceptualization and Measurement in the International Civic and Citizenship Education Study (ICCS) 2016 and 2022

Peer reviewed

Direct link

Valeria Damiani; Julian Fraillon – Large-scale Assessments in Education, 2025

Globalization and its impact on contemporary societies have gained new impetus with the notions of global citizenship education (GCED) and education for sustainable development (ESD), considered, together with civic and citizenship education (CCE), as a means for promoting students' engagement in global/local issues and providing them with the…

Descriptors: Civics, Citizenship Education, Global Approach, Sustainable Development

Autism Knowledge Assessments: A Closer Examination of Validity by Autism Experts

Peer reviewed

Direct link

Camilla M. McMahon; Maryellen Brunson McClain; Savannah Wells; Sophia Thompson; Jeffrey D. Shahidullah – Journal of Autism and Developmental Disorders, 2025

Purpose: The goal of the current study was to conduct a substantive validity review of four autism knowledge assessments with prior psychometric support (Gillespie-Lynch in J Autism and Dev Disord 45(8):2553-2566, 2015; Harrison in J Autism and Dev Disord 47(10):3281-3295, 2017; McClain in J Autism and Dev Disord 50(3):998-1006, 2020; McMahon…

Descriptors: Measures (Individuals), Psychometrics, Test Items, Accuracy

Influences of Carry-Over Effects across Scales on Mediation Analyses

Peer reviewed

Direct link

Kuan-Yu Jin; Yi-Jhen Wu; Ming Ming Chiu – Measurement: Interdisciplinary Research and Perspectives, 2025

Many education tests and psychological surveys elicit respondent views of similar constructs across scenarios (e.g., story followed by multiple choice questions) by repeating common statements across scales (one-statement-multiple-scale, OSMS). However, a respondent's earlier responses to the common statement can affect later responses to it…

Descriptors: Administrator Surveys, Teacher Surveys, Responses, Test Items

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Examining the Wording Effect: What Are We Measuring?

Peer reviewed

Direct link

Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025

In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…

Descriptors: Test Items, Factor Structure, Test Reliability, Semantics

Measuring Austrian Students' Procedural Knowledge at the End of Upper Secondary Level

Peer reviewed

Direct link

Christoph Ableitinger; Christian Dorner – International Journal of Mathematical Education in Science and Technology, 2025

The number of complaints university lecturers make about a lack of knowledge, especially first-year students' procedural knowledge, has increased recently. Due to missing adequate empirical evidence, a survey of procedural knowledge among students of Austrian high schools in their final year was conducted. For this purpose, test items for…

Descriptors: Knowledge Level, Cognitive Processes, High School Seniors, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Journal of Educational…	6
Education and Information…	5
International Journal of…	5
Measurement:…	4
Chemistry Education Research…	3
Educational Measurement:…	3
International Journal of…	3
International Journal of…	3
Journal of Computer Assisted…	3
Journal of Creative Behavior	3
Physical Review Physics…	3
Psychology in the Schools	3
SAGE Open	3
ACT Education Corp.	2
Annenberg Institute for…	2
Communique	2
Educational Psychology Review	2
Educational and Psychological…	2
Field Methods	2
International Journal of…	2
Journal of Autism and…	2
Journal of Baltic Science…	2
Journal of Biological…	2
Language Testing	2
Large-scale Assessments in…	2
More ▼

Okan Bulut	3
Benjamin W. Domingue	2
Joshua B. Gilbert	2
K. Kawena Begay	2
Kuan-Yu Jin	2
Kylie Gorney	2
Luke W. Miratrix	2
Miranda Kucera	2
Sachin Nedungadi	2
Selcuk Acar	2
Abdullah Faruk Kiliç	1
Adnan Pinar	1
Afsar Rouhi	1
Ahmed Al - Badri	1
Aiman Mohammad Freihat	1
Alex Alfredo Valenzuela-Romero	1
Alexander Kah	1
Alexander Unger	1
Ali Alqarni	1
Allan S. Cohen	1
Amanda Leigh Duncan	1
Amelia Pearson	1
Amirreza Mehrabi	1
Anders Persson	1
Andrew D. Ho	1
More ▼