ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	19
Since 2016 (last 10 years)	37
Since 2006 (last 20 years)	63

Descriptor

Difficulty Level	78
Item Response Theory	78
Scores	78
Test Items	60
Foreign Countries	25
Comparative Analysis	18
Models	16
Item Analysis	13
English (Second Language)	11
Language Tests	11
Psychometrics	11
Test Construction	11
Test Format	11
Second Language Learning	10
Statistical Analysis	10
Achievement Tests	9
Test Reliability	9
College Students	8
Computation	8
Elementary School Students	8
Multiple Choice Tests	8
Regression (Statistics)	8
Student Evaluation	8
Computer Assisted Testing	7
Gender Differences	7
More ▼

Publication Type

Journal Articles	58
Reports - Research	56
Reports - Evaluative	11
Dissertations/Theses -…	8
Speeches/Meeting Papers	7
Collected Works - Proceedings	2
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Reports - Descriptive	1

Education Level

Higher Education	22
Postsecondary Education	20
Secondary Education	12
Elementary Education	10
Junior High Schools	5
Middle Schools	5
High Schools	4
Intermediate Grades	4
Adult Education	2
Early Childhood Education	2
Grade 3	2
Grade 4	2
Grade 8	2
Grade 9	2
Adult Basic Education	1
Elementary Secondary Education	1
Grade 12	1
Grade 5	1
Grade 6	1
Grade 7	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

Canada	3
Turkey	3
United States	3
Brazil	2
China	2
Malaysia	2
Taiwan	2
Arkansas	1
Australia	1
Belgium	1
California	1
China (Beijing)	1
Colorado	1
District of Columbia	1
Finland	1
Florida	1
France	1
Germany (Berlin)	1
Greece	1
Illinois	1
Indonesia	1
Iran	1
Japan (Tokyo)	1
Maryland	1
Massachusetts	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
Test of English as a Foreign…	3
Graduate Record Examinations	2
Connecticut Mastery Testing…	1
Gates MacGinitie Reading Tests	1
International English…	1
Progress in International…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Stanford Early School…	1
Wechsler Individual…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 78 results Save | Export

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

The Impact of Cheating on Score Comparability via Pool-Based IRT Pre-Equating

Peer reviewed

Direct link

Liu, Jinghua; Becker, Kirk – Journal of Educational Measurement, 2022

For any testing programs that administer multiple forms across multiple years, maintaining score comparability via equating is essential. With continuous testing and high-stakes results, especially with less secure online administrations, testing programs must consider the potential for cheating on their exams. This study used empirical and…

Descriptors: Cheating, Item Response Theory, Scores, High Stakes Tests

Influence of Selected-Response Format Variants on Test Characteristics and Test-Taking Effort: An Empirical Study. Research Report. ETS RR-22-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Rios, Joseph A.; Ling, Guangming; Wang, Zhen; Gu, Lin; Yang, Zhitong; Liu, Lydia O. – ETS Research Report Series, 2022

Different variants of the selected-response (SR) item type have been developed for various reasons (i.e., simulating realistic situations, examining critical-thinking and/or problem-solving skills). Generally, the variants of SR item format are more complex than the traditional multiple-choice (MC) items, which may be more challenging to test…

Descriptors: Test Format, Test Wiseness, Test Items, Item Response Theory

Exploring the Comparability of Multiple-Choice and Constructed-Response Versions of Scenario-Based Assessment Tasks

Peer reviewed
PDF on ERIC

Download full text

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2022

As implementation of the "Next Generation Science Standards" moves forward, there is a need for new assessments that can measure students' integrated three-dimensional science learning. The National Research Council has suggested that these assessments be multicomponent tasks that utilize a combination of item formats including…

Descriptors: Multiple Choice Tests, Conditioning, Test Items, Item Response Theory

Taking Inventory of the Creative Behavior Inventory: An Item Response Theory Analysis of the CBI

Peer reviewed

Direct link

Rodriguez, Rebekah M.; Silvia, Paul J.; Kaufman, James C.; Reiter-Palmon, Roni; Puryear, Jeb S. – Creativity Research Journal, 2023

The original 90-item Creative Behavior Inventory (CBI) was a landmark self-report scale in creativity research, and the 28-item brief form developed nearly 20 years ago continues to be a popular measure of everyday creativity. Relatively little is known, however, about the psychometric properties of this widely used scale. In the current research,…

Descriptors: Creativity Tests, Creativity, Creative Thinking, Psychometrics

Exploring the Stability of Differential Item Functioning across Administrations and Critical Values Using the Rasch Separate Calibration "t"-Test Method

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2019

Differential Item Functioning (DIF) detection procedures provide validity evidence for proposed interpretations of test scores that can help researchers and practitioners ensure that test scores are free from potential bias, and that individual items do not create an advantage for any subgroup of examinees over another. In this study, we use the…

Descriptors: Item Response Theory, Test Items, Scores, Testing

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Comparing Test-Taking Behaviors of English Language Learners (ELLs) to Non-ELL Students: Use of Response Time in Measurement Comparability Research. Research Report. ETS RR-21-25

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ercikan, Kadriye – ETS Research Report Series, 2021

In this report, we demonstrate use of differential response time (DRT) methodology, an extension of differential item functioning methodology, for examining differences in how students from different backgrounds engage with assessment tasks. We analyze response time data from a digitally delivered mathematics assessment to examine timing…

Descriptors: Test Wiseness, English Language Learners, Reaction Time, Mathematics Tests

Psychometric Evaluation of Dictations with the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Hussein, Rasha Abed; Sabit, Shaker Holh; Alwan, Merriam Ghadhanfar; Wafqan, Hussam Mohammed; Baqer, Abeer Ameen; Ali, Muneam Hussein; Hachim, Safa K.; Sahi, Zahraa Tariq; AlSalami, Huda Takleef; Sulaiman, Bahaa Aldin Fawzi – International Journal of Language Testing, 2022

Dictation is a traditional technique for both teaching and testing overall language ability and listening comprehension. In a dictation, a passage is read aloud by the teacher and examinees write down what they hear. Due to the peculiar form of dictations, psychometric analysis of dictations is challenging. In a dictation, there is no clear…

Descriptors: Psychometrics, Verbal Communication, Teaching Methods, Language Skills

Psychometric Properties of Alternative Item Types Worth the Squeeze? An Investigation into the Psychometric Performance of Alternative Item Types

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Foley, Brett P.; Zurn, Jared – Journal of Applied Testing Technology, 2021

As assessments move from traditional paper-pencil administration to computer-based administration, many testing programs are incorporating alternative item types (AITs) into assessments with the goals of measuring higher-order thinking, offering insight into problem-solving, and representing authentic real-world tasks. This paper explores multiple…

Descriptors: Psychometrics, Alternative Assessment, Computer Assisted Testing, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ProQuest LLC	8
ETS Research Report Series	6
Journal of Educational…	6
Educational and Psychological…	4
Hacettepe University Journal…	2
International Educational…	2
Language Assessment Quarterly	2
Language Testing	2
Language Testing in Asia	2
Online Submission	2
Acta Educationis Generalis	1
African Journal of Research…	1
Applied Measurement in…	1
Applied Psychological…	1
Asia Pacific Education Review	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
CBE - Life Sciences Education	1
College Board	1
College Entrance Examination…	1
Creativity Research Journal	1
Education Research and…	1
Educational Assessment,…	1
Educational Testing Service	1
Grantee Submission	1
More ▼

Guo, Hongwen	3
Liu, Jinghua	2
Long, Caroline	2
Rios, Joseph A.	2
Abdel-fattah, Abdel-fattah A.	1
Acquaye, Rosemary	1
Akers, Kate	1
AlSalami, Huda Takleef	1
Albanese, Emiliano	1
Ali, Muneam Hussein	1
Alwan, Merriam Ghadhanfar	1
Andrich, David	1
Andrés Christiansen	1
Anwyll, Steve	1
Armijo-Olivo, Susan	1
Aryadoust, Vahid	1
Atar, Burcu	1
Azano, Amy	1
Aziz, Azrilah Abdul	1
Baghaei, Purya	1
Bansilal, Sarah	1
Baqer, Abeer Ameen	1
Bazaldua, Diego A. Luna	1
Becker, Kirk	1
More ▼