ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	30
Since 2016 (last 10 years)	78
Since 2006 (last 20 years)	145

Descriptor

Comparative Analysis	214
Difficulty Level	214
Test Items	214
Item Response Theory	71
Item Analysis	64
Foreign Countries	54
Correlation	38
Test Format	37
Scores	36
Test Construction	36
Multiple Choice Tests	32
Statistical Analysis	30
Achievement Tests	29
Test Reliability	27
Language Tests	26
Test Bias	26
Mathematics Tests	23
Psychometrics	23
Equated Scores	22
English (Second Language)	21
Higher Education	21
Test Validity	21
Models	20
Computer Assisted Testing	19
Computation	18
More ▼

Publication Type

Reports - Research	161
Journal Articles	131
Speeches/Meeting Papers	39
Reports - Evaluative	27
Dissertations/Theses -…	12
Tests/Questionnaires	10
Reports - Descriptive	9
Numerical/Quantitative Data	4
Collected Works - Serials	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	36
Postsecondary Education	32
Secondary Education	24
Elementary Education	20
High Schools	12
Middle Schools	10
Early Childhood Education	7
Grade 3	7
Intermediate Grades	7
Primary Education	7
Elementary Secondary Education	6
Junior High Schools	6
Grade 8	5
Grade 4	4
Grade 7	4
Grade 12	3
Grade 5	3
Grade 6	3
Grade 9	3
Grade 11	1
Grade 2	1
Kindergarten	1
More ▼

Audience

Researchers

Location

Australia	5
Germany	5
Indonesia	5
United States	5
South Korea	4
Turkey	4
Japan	3
Nigeria	3
United Kingdom (England)	3
Belgium	2
District of Columbia	2
Iran	2
Massachusetts	2
New York	2
Norway	2
Arkansas	1
Austria	1
Brazil	1
Canada	1
Chile	1
China	1
China (Beijing)	1
Colorado	1
Croatia	1
Czech Republic	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 214 results Save | Export

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

The Impact of Cheating on Score Comparability via Pool-Based IRT Pre-Equating

Peer reviewed

Direct link

Liu, Jinghua; Becker, Kirk – Journal of Educational Measurement, 2022

For any testing programs that administer multiple forms across multiple years, maintaining score comparability via equating is essential. With continuous testing and high-stakes results, especially with less secure online administrations, testing programs must consider the potential for cheating on their exams. This study used empirical and…

Descriptors: Cheating, Item Response Theory, Scores, High Stakes Tests

How Useful Is Comparative Judgement of Item Difficulty for Standard Maintaining?

Download full text

Benton, Tom – Research Matters, 2020

This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…

Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis

Adjustment for Guessing in a Basic Statistics Test for Indonesian Undergraduate Psychology Students Using the Rasch Model

Peer reviewed

Direct link

Hayat, Bahrul – Cogent Education, 2022

The purpose of this study comprises (1) calibrating the Basic Statistics Test for Indonesian undergraduate psychology students using the Rasch model, (2) testing the impact of adjustment for guessing on item parameters, person parameters, test reliability, and distribution of item difficulty and person ability, and (3) comparing person scores…

Descriptors: Guessing (Tests), Statistics Education, Undergraduate Students, Psychology

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Exploring the Comparability of Multiple-Choice and Constructed-Response Versions of Scenario-Based Assessment Tasks

Peer reviewed
PDF on ERIC

Download full text

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2022

As implementation of the "Next Generation Science Standards" moves forward, there is a need for new assessments that can measure students' integrated three-dimensional science learning. The National Research Council has suggested that these assessments be multicomponent tasks that utilize a combination of item formats including…

Descriptors: Multiple Choice Tests, Conditioning, Test Items, Item Response Theory

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

Evaluating Gordon's Primary Measures of Music Audiation with a National Sample: An Examination of Its Psychometric Properties and Usefulness

Direct link

Bacon, Terrence E. – ProQuest LLC, 2023

The purpose of this study was to investigate developmental music aptitude with a broader sample in order to propose national norms. Research questions were: 1) To what extent are published Primary Measures of Music Aptitude (PMMA) norms different from those established using a current sample? 2) Are there comparative differences in PMMA item…

Descriptors: Psychometrics, Music, Aptitude Tests, Test Items

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Developing a Methodology for Investigating Student Comprehension of Organic Chemistry Using Eye Tracking Technology

Direct link

Thacker, Nathan L. – ProQuest LLC, 2023

Organic chemistry is a class well known to be difficult and necessary for many careers in the sciences, and as a result, has garnered interest in researching ways to improve student learning and comprehension. One potential way involves using eye tracking techniques to understand how students visually examine questions. Organic chemistry involves…

Descriptors: Science Instruction, Multiple Choice Tests, Organic Chemistry, Science Tests

Test Score Equating of Multiple-Choice Mathematics Items: Techniques from Characteristic Curve of Modern Psychometric Theory

Peer reviewed

Direct link

Musa Adekunle Ayanwale – Discover Education, 2023

Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…

Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

Comparison of Disengagement Levels and the Impact of Disengagement on Item Parameters between PISA 2015 and PISA 2018 in the United States

Peer reviewed

Direct link

Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023

Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…

Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Educational and Psychological…	16
ProQuest LLC	12
Journal of Educational…	11
ETS Research Report Series	8
Applied Measurement in…	5
Grantee Submission	5
Assessment & Evaluation in…	4
Educational Research and…	4
Language Assessment Quarterly	4
Journal of Experimental…	3
Journal of Speech, Language,…	3
Language Testing	3
Large-scale Assessments in…	3
Advances in Health Sciences…	2
Applied Psychological…	2
College Board	2
Eurasian Journal of…	2
International Journal of…	2
International Journal of…	2
Journal of Applied Measurement	2
Partnership for Assessment of…	2
Perspectives in Education	2
Physical Review Physics…	2
SAGE Open	2
Acta Educationis Generalis	1
More ▼

DeBoer, George E.	3
Herrmann-Abell, Cari F.	3
Hsu, Tse-Chi	3
Kim, Sooyeon	3
Sinharay, Sandip	3
Benson, Jeri	2
Benton, Tom	2
Beretvas, S. Natasha	2
Brutten, Sheila R.	2
Cai, Li	2
Cohen, Allan S.	2
DeMars, Christine E.	2
Hardcastle, Joseph	2
Kamata, Akihito	2
Kim, Seock-Ho	2
Kirisci, Levent	2
Liu, Jinghua	2
Liu, Ou Lydia	2
Lord, Frederic M.	2
Lunz, Mary E.	2
Magis, David	2
Nelson, Gena	2
Robitzsch, Alexander	2
Schuppan, Fred	2
More ▼

Program for International…	7
Trends in International…	6
Graduate Record Examinations	3
National Assessment of…	3
SAT (College Admission Test)	3
International English…	2
Test of English as a Foreign…	2
Wide Range Achievement Test	2
Advanced Placement…	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Embedded Figures Test	1
Iowa Tests of Educational…	1
Measures of Academic Progress	1
Michigan Test of English…	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Progress in International…	1
Stanford Achievement Tests	1
Test of English for…	1
UCLA Loneliness Scale	1
Wechsler Individual…	1
Woodcock Reading Mastery Test	1
More ▼