ERIC - Search Results

Publication Date

In 2025

Descriptor

Correlation	6
Test Items	6
College Students	3
Foreign Countries	3
Item Analysis	3
Item Response Theory	3
Test Format	3
Academic Achievement	2
Cutting Scores	2
Educational Assessment	2
Effect Size	2
Intervention	2
Language Tests	2
Psychometrics	2
Test Construction	2
Accuracy	1
Achievement Tests	1
Age Differences	1
Artificial Intelligence	1
Causal Models	1
Comparative Analysis	1
Creative Thinking	1
Creativity	1
Creativity Tests	1
Difficulty Level	1
More ▼

Source

British Journal of…	1
Educational Psychology Review	1
International Journal of…	1
Journal of Creative Behavior	1
Journal of Educational and…	1
Language Assessment Quarterly	1

Author

Bayram Çetin	1
Benjamin W. Domingue	1
Bianca A. Simonsmeier	1
Duyen Thi Bich Nguyen	1
Hung Tan Ha	1
Joshua B. Gilbert	1
Luke W. Miratrix	1
Michael Schneider	1
Mridul Joshi	1
Peter A. Edelsbrunner	1
Rümeysa Kaya	1
Selcuk Acar	1
Shreya Bhandari	1
Tim Stoeckel	1
Yunting Liu	1
Yuyang Shen	1
Zachary A. Pardos	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	4
Information Analyses	1
Reports - Evaluative	1

Education Level

Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Primary Education	1
Secondary Education	1

Audience

Location

Turkey	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Remote Associates Test	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Leveraging LLM Respondents for Item Evaluation: A Psychometric Analysis

Peer reviewed

Direct link

Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025

Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…

Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment

The Cronbach's Alpha of Domain-Specific Knowledge Tests before and after Learning: A Meta-Analysis of Published Studies

Peer reviewed

Direct link

Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025

Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…

Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level

Along the Convergent-Divergent Continuum: The Role of Task Structure in the PISA Creative Thinking Assessment

Peer reviewed

Direct link

Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025

Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…

Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity

Peer reviewed

Direct link

Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025

Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…

Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics

Examining the Cut-Off Score of the English B1 Progression Exam According to Different Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Rümeysa Kaya; Bayram Çetin – International Journal of Assessment Tools in Education, 2025

In this study, the cut-off scores obtained from the Angoff, Angoff Y/N, Nedelsky and Ebel standard methods were compared with the 50 T score and the current cut-off score in various aspects. Data were collected from 448 students who took Module B1+ English Exit Exam IV and 14 experts. It was seen that while the Nedelsky method gave the lowest…

Descriptors: Standard Setting, Cutting Scores, Exit Examinations, Academic Achievement