ERIC - Search Results

Publication Date

In 2025	9
Since 2024	19
Since 2021 (last 5 years)	67
Since 2016 (last 10 years)	133
Since 2006 (last 20 years)	207

Descriptor

Difficulty Level	215
Test Items	215
Foreign Countries	104
Undergraduate Students	81
Item Response Theory	64
Test Reliability	53
Item Analysis	52
Multiple Choice Tests	52
Test Construction	51
Test Validity	43
College Students	42
Scores	37
Test Format	35
Comparative Analysis	34
College Entrance Examinations	33
Science Tests	31
Correlation	30
Statistical Analysis	27
Language Tests	26
English (Second Language)	24
Preservice Teachers	24
Mathematics Tests	22
Psychometrics	22
Second Language Learning	21
Student Attitudes	20
More ▼

Publication Type

Journal Articles	192
Reports - Research	182
Tests/Questionnaires	12
Reports - Descriptive	11
Reports - Evaluative	11
Dissertations/Theses -…	7
Speeches/Meeting Papers	7
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Postsecondary Education	215
Higher Education	214
Secondary Education	23
Elementary Education	10
High Schools	10
Junior High Schools	4
Middle Schools	4
Elementary Secondary Education	3
Adult Education	1
Early Childhood Education	1
Grade 12	1
Grade 3	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Teachers

Location

Turkey	11
Canada	10
Australia	9
Indonesia	8
Iran	8
China	6
Germany	6
United Kingdom	6
Japan	4
United States	4
Colorado	3
Nigeria	3
South Africa	3
Thailand	3
United Kingdom (England)	3
Croatia	2
India	2
Ireland	2
Jordan	2
New York	2
Oman	2
Russia	2
Saudi Arabia	2
Singapore	2
Africa	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	12
Graduate Record Examinations	7
International English…	2
Remote Associates Test	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
Big Five Inventory	1
Defining Issues Test	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Praxis Series	1
Program for International…	1
Raven Progressive Matrices	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 215 results Save | Export

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

Beyond Item Analysis: Connecting Student Behaviour and Performance Using E-Assessment Logs

Peer reviewed

Direct link

Lahza, Hatim; Smith, Tammy G.; Khosravi, Hassan – British Journal of Educational Technology, 2023

Traditional item analyses such as classical test theory (CTT) use exam-taker responses to assessment items to approximate their difficulty and discrimination. The increased adoption by educational institutions of electronic assessment platforms (EAPs) provides new avenues for assessment analytics by capturing detailed logs of an exam-taker's…

Descriptors: Medical Students, Evaluation, Computer Assisted Testing, Time Factors (Learning)

Argument-Based Validation of Chulalongkorn University Language Institute (CULI) Test: A Rasch-Based Evidence Investigation

Peer reviewed

Direct link

Apichat Khamboonruang – Language Testing in Asia, 2025

Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…

Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests

Utilizing Linear Logistic Test Models to Explore Item Characteristics of Medical Subspecialty Certification Examinations

Peer reviewed

Direct link

Emily K. Toutkoushian; Huaping Sun; Mark T. Keegan; Ann E. Harman – Measurement: Interdisciplinary Research and Perspectives, 2024

Linear logistic test models (LLTMs), leveraging item response theory and linear regression, offer an elegant method for learning about item characteristics in complex content areas. This study used LLTMs to model single-best-answer, multiple-choice-question response data from two medical subspecialty certification examinations in multiple years…

Descriptors: Licensing Examinations (Professions), Certification, Medical Students, Test Items

The Nature and Prevalence of Diagnostic Testing in Mathematics at Tertiary-Level in Ireland

Peer reviewed

Direct link

Hyland, Diarmaid; O'Shea, Ann – Teaching Mathematics and Its Applications, 2022

In this study, we conducted a survey of all tertiary level institutions in Ireland to find out how many of them use diagnostic tests, and what kind of mathematical content areas and topics appear on these tests. The information gathered provides an insight into what instructors expect students to know on entry to university and what they expect…

Descriptors: Foreign Countries, Diagnostic Tests, Mathematics Tests, College Freshmen

Meeting Students Where They Are: Using Rasch Modeling for Improving the Measurement of Active Research in Higher Education

Peer reviewed

Direct link

Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023

Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…

Descriptors: Surveys, Rating Scales, Higher Education, Educational Research

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Does Question Order Matter on Online Math Assessments? A Big Data Analysis of Undergraduate Mathematics Final Exams

Peer reviewed

Direct link

Gruss, Richard; Clemons, Josh – Journal of Computer Assisted Learning, 2023

Background: The sudden growth in online instruction due to COVID-19 restrictions has given renewed urgency to questions about remote learning that have remained unresolved. Web-based assessment software provides instructors an array of options for varying testing parameters, but the pedagogical impacts of some of these variations has yet to be…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Mathematics Tests

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

Influence of Selected-Response Format Variants on Test Characteristics and Test-Taking Effort: An Empirical Study. Research Report. ETS RR-22-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Rios, Joseph A.; Ling, Guangming; Wang, Zhen; Gu, Lin; Yang, Zhitong; Liu, Lydia O. – ETS Research Report Series, 2022

Different variants of the selected-response (SR) item type have been developed for various reasons (i.e., simulating realistic situations, examining critical-thinking and/or problem-solving skills). Generally, the variants of SR item format are more complex than the traditional multiple-choice (MC) items, which may be more challenging to test…

Descriptors: Test Format, Test Wiseness, Test Items, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ETS Research Report Series	11
CBE - Life Sciences Education	8
ProQuest LLC	7
Anatomical Sciences Education	5
Assessment & Evaluation in…	5
Educational and Psychological…	5
Online Submission	5
Practical Assessment,…	5
International Journal of…	4
Language Testing	4
SAGE Open	4
College Board	3
College Entrance Examination…	3
Journal of Chemical Education	3
Journal of Education and…	3
Journal of Experimental…	3
Physical Review Physics…	3
Physical Review Special…	3
Teaching of Psychology	3
Applied Measurement in…	2
Creativity Research Journal	2
Eurasian Journal of…	2
European Journal of…	2
IEEE Transactions on Education	2
International Association for…	2
More ▼

Baghaei, Purya	3
Ahmadi, Alireza	2
Alexander, Patricia A.	2
Attali, Yigal	2
Bichi, Ado Abdu	2
DiBattista, David	2
Dorans, Neil J.	2
Gierl, Mark J.	2
Gu, Lin	2
Guo, Hongwen	2
Herbert, Sandra	2
Karadag, Nejdet	2
Khoshdel, Fahimeh	2
Ling, Guangming	2
Liu, Jinghua	2
Liu, Ou Lydia	2
Livy, Sharyn	2
Maryani, Ika	2
Murphy, Kristen L.	2
O'Keeffe, Lisa	2
Perez, Kathryn E.	2
Planinic, Maja	2
Pollock, Steven J.	2
Prasetyo, Zuhdan Kun	2
Price, Rebecca M.	2
More ▼