ERIC - Search Results

Publication Date

In 2025	7
Since 2024	9
Since 2021 (last 5 years)	33
Since 2016 (last 10 years)	65
Since 2006 (last 20 years)	111

Descriptor

Difficulty Level	155
Psychometrics	155
Test Items	155
Item Response Theory	76
Test Construction	49
Foreign Countries	46
Test Reliability	46
Test Validity	38
Item Analysis	36
Multiple Choice Tests	27
Scores	26
Models	25
Comparative Analysis	23
Correlation	19
Scoring	18
Computer Assisted Testing	17
Elementary School Students	16
Mathematics Tests	15
Statistical Analysis	15
Test Format	15
High School Students	14
Goodness of Fit	13
Guessing (Tests)	12
Cognitive Processes	11
Measurement Techniques	11
More ▼

Publication Type

Reports - Research	113
Journal Articles	110
Reports - Evaluative	15
Speeches/Meeting Papers	12
Reports - Descriptive	11
Dissertations/Theses -…	9
Numerical/Quantitative Data	4
Tests/Questionnaires	4
Information Analyses	2
Guides - Non-Classroom	1

Education Level

Higher Education	26
Elementary Education	24
Secondary Education	24
Postsecondary Education	22
High Schools	13
Junior High Schools	10
Middle Schools	10
Grade 2	7
Primary Education	7
Early Childhood Education	6
Elementary Secondary Education	4
Grade 1	4
Grade 3	4
Intermediate Grades	4
Kindergarten	4
Grade 4	3
Grade 5	3
Grade 8	3
Grade 12	2
Grade 6	2
Grade 7	1
More ▼

Audience

Researchers	3
Teachers	1

Location

Nigeria	5
Canada	4
Turkey	4
Germany	3
Greece	3
Indonesia	3
New York	3
South Korea	3
United States	3
Australia	2
Jordan	2
Taiwan	2
United Kingdom	2
Asia	1
Bulgaria	1
Canada (Montreal)	1
China	1
Cyprus	1
Dominica	1
Florida	1
France	1
Georgia	1
Grenada	1
Hong Kong	1
India	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 155 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Investigating Construct Validity of Cognitive Load Measurement Using Single-Item Subjective Rating Scales

Peer reviewed

Direct link

Katrin Schuessler; Vanessa Fischer; Maik Walpuski – Instructional Science: An International Journal of the Learning Sciences, 2025

Cognitive load studies are mostly centered on information on perceived cognitive load. Single-item subjective rating scales are the dominant measurement practice to investigate overall cognitive load. Usually, either invested mental effort or perceived task difficulty is used as an overall cognitive load measure. However, the extent to which the…

Descriptors: Cognitive Processes, Difficulty Level, Rating Scales, Construct Validity

Argument-Based Validation of Chulalongkorn University Language Institute (CULI) Test: A Rasch-Based Evidence Investigation

Peer reviewed

Direct link

Apichat Khamboonruang – Language Testing in Asia, 2025

Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…

Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests

Meeting Students Where They Are: Using Rasch Modeling for Improving the Measurement of Active Research in Higher Education

Peer reviewed

Direct link

Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023

Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…

Descriptors: Surveys, Rating Scales, Higher Education, Educational Research

The Knowledge of Autism Questionnaire-UK: Development and Initial Psychometric Evaluation

Peer reviewed

Direct link

Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025

Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…

Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

Evaluating Gordon's Primary Measures of Music Audiation with a National Sample: An Examination of Its Psychometric Properties and Usefulness

Direct link

Bacon, Terrence E. – ProQuest LLC, 2023

The purpose of this study was to investigate developmental music aptitude with a broader sample in order to propose national norms. Research questions were: 1) To what extent are published Primary Measures of Music Aptitude (PMMA) norms different from those established using a current sample? 2) Are there comparative differences in PMMA item…

Descriptors: Psychometrics, Music, Aptitude Tests, Test Items

Taking Inventory of the Creative Behavior Inventory: An Item Response Theory Analysis of the CBI

Peer reviewed

Direct link

Rodriguez, Rebekah M.; Silvia, Paul J.; Kaufman, James C.; Reiter-Palmon, Roni; Puryear, Jeb S. – Creativity Research Journal, 2023

The original 90-item Creative Behavior Inventory (CBI) was a landmark self-report scale in creativity research, and the 28-item brief form developed nearly 20 years ago continues to be a popular measure of everyday creativity. Relatively little is known, however, about the psychometric properties of this widely used scale. In the current research,…

Descriptors: Creativity Tests, Creativity, Creative Thinking, Psychometrics

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity

Peer reviewed

Direct link

Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025

Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…

Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics

Examining the Psychometric Properties of the Chemistry Self-Concept Inventory Using Rasch Modeling

Peer reviewed

Direct link

Stephanie M. Werner; Ying Chen; Mike Stieff – Journal of Chemical Education, 2021

The Chemistry Self-Concept Inventory (CSCI) is a widely used instrument within chemistry education research. Yet, agreement on its overall reliability and validity is lacking, and psychometric analyses of the instrument remain outstanding. This study examined the psychometric properties of the subscale and item function of the CSCI on 1140 high…

Descriptors: Self Concept Measures, Chemistry, Psychometrics, Item Response Theory

Test Score Equating of Multiple-Choice Mathematics Items: Techniques from Characteristic Curve of Modern Psychometric Theory

Peer reviewed

Direct link

Musa Adekunle Ayanwale – Discover Education, 2023

Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…

Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

ETS Research Report Series	9
ProQuest LLC	9
Educational and Psychological…	8
Journal of Educational…	6
Applied Measurement in…	5
Grantee Submission	5
Applied Psychological…	4
International Journal of…	3
Journal of Psychoeducational…	3
Psychometrika	3
Advances in Health Sciences…	2
College Board	2
International Journal of…	2
Journal of Chemical Education	2
Journal of Education and…	2
Journal of Educational and…	2
Language Testing	2
Malaysian Journal of Learning…	2
Online Submission	2
SAGE Open	2
American Journal of…	1
Annenberg Institute for…	1
Applied Cognitive Psychology	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
More ▼

Lord, Frederic M.	4
Paek, Insu	4
Bejar, Isaac I.	3
Schoen, Robert C.	3
Yang, Xiaotong	3
Benjamin W. Domingue	2
Dorans, Neil J.	2
Gierl, Mark J.	2
Gorin, Joanna S.	2
Joshua B. Gilbert	2
Katz, Irvin R.	2
Liu, Sicong	2
Luke W. Miratrix	2
Martinez, Michael E.	2
Mike Stieff	2
Mridul Joshi	2
Revuelta, Javier	2
Smith, Richard M.	2
Stephanie M. Werner	2
Ying Chen	2
Yocom, Peter	2
Aborisade, Olatunbosun James	1
Adeleke, A. A.	1
Ahmed Al - Badri	1
More ▼

Graduate Record Examinations	4
SAT (College Admission Test)	3
Hidden Figures Test	2
Raven Progressive Matrices	2
Adult Attachment Interview	1
Armed Services Vocational…	1
Bender Visual Motor Gestalt…	1
Childrens Manifest Anxiety…	1
Comprehensive Tests of Basic…	1
Flesch Kincaid Grade Level…	1
Goodenough Harris Drawing Test	1
Medical College Admission Test	1
Metropolitan Achievement Tests	1
Peabody Developmental Motor…	1
Peabody Picture Vocabulary…	1
Program for International…	1
Sentence Completion Test	1
Stanford Achievement Tests	1
Test of English as a Foreign…	1
Test of English for…	1
Trends in International…	1
Wechsler Intelligence Scale…	1
More ▼