ERIC - Search Results

Publication Date

In 2025	12
Since 2024	46

Publication Type

Journal Articles	42
Reports - Research	38
Information Analyses	3
Reports - Evaluative	3
Dissertations/Theses -…	2
Tests/Questionnaires	2
Collected Works - General	1
Reports - Descriptive	1

Education Level

Higher Education	12
Postsecondary Education	12
Secondary Education	8
Elementary Education	3
Junior High Schools	3
Middle Schools	3
Early Childhood Education	1
Grade 4	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Preschool Education	1
More ▼

Audience

Location

Iran	3
Japan	3
Turkey	3
United States	3
Canada	1
China	1
Finland	1
France	1
Ireland	1
Japan (Tokyo)	1
Netherlands	1
Oklahoma (Tulsa)	1
Poland	1
Spain	1
Thailand (Bangkok)	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Northern…	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Head Start

Assessments and Surveys

Program for International…	5
Measures of Academic Progress	2
ACT Assessment	1
NEO Personality Inventory	1
Phonological Awareness…	1
Program for the International…	1
Progress in International…	1
Remote Associates Test	1
Stages of Concern…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 46 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Under the Weather? The Effects of Temperature on Student Test Performance. EdWorkingPaper No. 24-910

Download full text

Deven Carlson; Adam Shepardson – Annenberg Institute for School Reform at Brown University, 2024

As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a…

Descriptors: Weather, Climate, Heat, Academic Achievement

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Artificial Intelligence and Educational Measurement: Opportunities and Threats

Peer reviewed

Direct link

Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024

I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…

Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

A Three-Step DIF Analysis of a Reading Comprehension Test across Regional Dialects to Improve Test Score Validity

Peer reviewed

Direct link

Paula Elosua – Language Assessment Quarterly, 2024

In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…

Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity

A Historic Review and Empirical Revitalization of the Stages of Concern Questionnaire

Peer reviewed
PDF on ERIC

Download full text

Kent Anderson Seidel – School Leadership Review, 2025

This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…

Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention

Psychometric Properties of the Academic Procrastination Scale in an Iranian Sample

Peer reviewed

Direct link

Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025

Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…

Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries

Standards, Accountability, and Provincial Testing: Shaping Homework and Teaching

Peer reviewed

Direct link

Carolyn Clarke – in education, 2024

This ethnographic case study, situated in Newfoundland and Labrador, Canada, examined the effects of full-scale provincial testing on families, its influences on homework, and familial accountability for teaching and learning. Data were drawn from family interviews, as well as letters and documents regarding homework. Teachers sensed a significant…

Descriptors: Academic Standards, Accountability, Testing, Homework

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	3
International Journal of…	3
Journal of Educational…	3
AERA Open	2
Education and Information…	2
Journal of Creative Behavior	2
Language Testing in Asia	2
Measurement:…	2
ProQuest LLC	2
Vocabulary Learning and…	2
ACT Education Corp.	1
Active Learning in Higher…	1
Annenberg Institute for…	1
Chemistry Education Research…	1
College Teaching	1
Discover Education	1
ETS Research Report Series	1
Educational Assessment,…	1
Grantee Submission	1
HAPS Educator	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Educational and…	1
More ▼

Jianbin Fu	2
Patrick C. Kyllonen	2
Tim Stoeckel	2
Tomoko Ishii	2
Xuan Tan	2
Adam Shepardson	1
Agnieszka Slezak-Swiat	1
Ahmet Yildirim	1
Allan S. Cohen	1
Andrew D. Ho	1
Andrés Christiansen	1
Apichat Khamboonruang	1
B. Barbot	1
B. Goecke	1
Bernardette J. Pinetta	1
Carolin Hahnel	1
Carolyn Clarke	1
Changkyung Song	1
Christina M. Cassano	1
Christoph Kern	1
Christopher F. Chabris	1
Chunmei Huang	1
Dana Murano	1
Daniel F. McCaffrey	1
Deborah Rivas-Drake	1
More ▼

Scores	46
Test Items	46
Foreign Countries	17
Item Analysis	13
Item Response Theory	12
Test Validity	11
Test Reliability	10
Test Construction	9
Test Format	9
Achievement Tests	8
Computer Assisted Testing	8
English (Second Language)	8
Language Tests	8
Artificial Intelligence	7
Comparative Analysis	7
Correlation	7
Difficulty Level	7
Second Language Learning	7
International Assessment	6
Measures (Individuals)	6
Psychometrics	6
Reading Tests	6
Academic Achievement	5
Accuracy	5
Computer Software	5
More ▼