ERIC - Search Results

Publication Date

In 2025	12
Since 2024	15
Since 2021 (last 5 years)	59
Since 2016 (last 10 years)	139
Since 2006 (last 20 years)	193

Descriptor

Item Response Theory	216
Test Items	216
Test Reliability	216
Test Validity	109
Foreign Countries	74
Difficulty Level	71
Psychometrics	68
Test Construction	64
Scores	45
Goodness of Fit	35
Item Analysis	35
Scoring	31
Multiple Choice Tests	27
Test Bias	27
Mathematics Tests	25
Science Tests	25
High School Students	24
Computer Assisted Testing	23
Factor Analysis	23
Models	22
Comparative Analysis	21
Measures (Individuals)	21
Statistical Analysis	21
Undergraduate Students	21
Correlation	20
More ▼

Publication Type

Journal Articles	169
Reports - Research	162
Reports - Evaluative	26
Reports - Descriptive	13
Speeches/Meeting Papers	13
Dissertations/Theses -…	12
Tests/Questionnaires	11
Numerical/Quantitative Data	7
Guides - General	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Secondary Education	48
Higher Education	46
Postsecondary Education	38
Elementary Education	35
High Schools	29
Middle Schools	26
Junior High Schools	20
Early Childhood Education	13
Intermediate Grades	11
Elementary Secondary Education	10
Primary Education	10
Grade 8	9
Grade 5	8
Grade 6	7
Grade 7	6
Grade 1	5
Grade 2	5
Grade 3	5
Grade 4	5
Kindergarten	5
Grade 9	4
Grade 12	2
Adult Education	1
Preschool Education	1
More ▼

Audience

Location

Indonesia	15
Florida	8
Turkey	7
Germany	6
United States	6
Taiwan	5
Australia	4
Iran	4
California	3
Canada	3
China	3
Malaysia	3
New Mexico	3
Nigeria	3
Singapore	3
South Korea	3
Alabama	2
France	2
Japan	2
Oregon	2
South Africa	2
Texas	2
Utah	2
Arizona	1
Asia	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…

What Works Clearinghouse Rating

Showing 1 to 15 of 216 results Save | Export

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Psychometric Properties of the Academic Procrastination Scale in an Iranian Sample

Peer reviewed

Direct link

Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025

Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…

Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries

How Many Response Categories Are Sufficient for Likert Type Scales? An Empirical Study Based on the Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Toraman, Cetin – International Journal of Assessment Tools in Education, 2022

The current study investigates the optimum number of response categories for the Likert type of scales under the item response theory (IRT). The data was collected from university students attend to mainly the faculty of medicine and the faculty of education. A form of the "Social Gender Equity Scale" developed by Gozutok et al. (2017)…

Descriptors: Likert Scales, Item Response Theory, College Students, Test Reliability

Psychometric Analysis of the Resonance Concept Inventory

Peer reviewed

Direct link

Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025

Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…

Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Application of the Rasch Model in Streamlining an Instrument Measuring Depression among College Students

Peer reviewed
PDF on ERIC

Download full text

Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023

Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…

Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability

Validity and Reliability Analysis of a Socioscientific Issues-Based Critical Thinking Self-Assessment Instrument Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025

Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…

Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Applying Alternative Method to Evaluate Online Problem-Solving Skill Inventory (OPSI) Using Rasch Model Analysis

Peer reviewed

Direct link

Che Lah, Noor Hidayah; Tasir, Zaidatun; Jumaat, Nurul Farhana – Educational Studies, 2023

The aim of the study was to evaluate the extended version of the Problem-Solving Inventory (PSI) via an online learning setting known as the Online Problem-Solving Inventory (OPSI) through the lens of Rasch Model analysis. To date, there is no extended version of the PSI for online settings even though many researchers have used it; thus, this…

Descriptors: Problem Solving, Measures (Individuals), Electronic Learning, Item Response Theory

Modeling Local Item Dependence in Cloze Tests with the Rasch Model: Applying a New Strategy

Peer reviewed
PDF on ERIC

Download full text

Barno S. Abdullaeva; Diyorjon Abdullaev; Nurislom I. Khursanov; Khurshida B. Kadirova; Laylo Djuraeva – International Journal of Language Testing, 2024

Cloze tests are commonly used in language testing as a quick measure of overall language ability or reading comprehension. A problem for the analysis of cloze tests with item response theory models is that cloze test items are locally dependent. This leads to the violation of the conditional or local independence assumption of IRT models. In this…

Descriptors: Cloze Procedure, Language Tests, Test Items, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ProQuest LLC	12
Educational and Psychological…	8
Grantee Submission	7
Journal of Educational…	7
Online Submission	7
SAGE Open	6
ETS Research Report Series	5
International Journal of…	5
Applied Psychological…	4
Journal of Baltic Science…	4
Journal of Psychoeducational…	4
International Journal of…	3
International Journal of…	3
International Journal of…	3
Journal of Intelligence	3
Journal of Speech, Language,…	3
Language Testing	3
Psychometrika	3
Applied Measurement in…	2
Assessment for Effective…	2
Chemistry Education Research…	2
College Board	2
Cypriot Journal of…	2
EURASIA Journal of…	2
Educational Measurement:…	2
More ▼

Schoen, Robert C.	6
Anderson, Daniel	4
Petscher, Yaacov	4
Bauduin, Charity	3
Boone, William J.	3
Paek, Insu	3
Yang, Xiaotong	3
Zhang, Jinming	3
Bichi, Ado Abdu	2
Brown, Ted	2
Dogan, Nuri	2
Edwards, Michael C.	2
Guo, Hongwen	2
Hartig, Johannes	2
Istiyono, Edi	2
Lee, Yi-Hsuan	2
Lee, Young-Sun	2
Liu, Sicong	2
Meijer, Rob R.	2
Mike Stieff	2
Myszkowski, Nils	2
Nicewander, W. Alan	2
Retnawati, Heri	2
Segall, Daniel O.	2
More ▼

Graduate Record Examinations	5
SAT (College Admission Test)	4
ACT Assessment	2
Iowa Tests of Basic Skills	2
Stanford Achievement Tests	2
Test of English as a Foreign…	2
Trends in International…	2
Armed Forces Qualification…	1
Bruininks Oseretsky Test of…	1
Center for Epidemiologic…	1
Child Behavior Checklist	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Hidden Figures Test	1
Kaufman Test of Educational…	1
MacArthur Communicative…	1
Measures of Academic Progress	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Preliminary Scholastic…	1
Raven Progressive Matrices	1
Student Teacher Relationship…	1
Wechsler Adult Intelligence…	1
More ▼