ERIC - Search Results

Publication Date

In 2025	17
Since 2024	46
Since 2021 (last 5 years)	143
Since 2016 (last 10 years)	301
Since 2006 (last 20 years)	621

Descriptor

Psychometrics	819
Test Items	819
Test Construction	299
Item Response Theory	291
Test Reliability	224
Test Validity	223
Foreign Countries	206
Item Analysis	160
Difficulty Level	154
Scores	137
Models	113
Factor Analysis	109
Statistical Analysis	98
Computer Assisted Testing	92
Evaluation Methods	89
Measures (Individuals)	89
Scoring	89
Correlation	85
Measurement Techniques	78
Test Bias	76
Comparative Analysis	75
Goodness of Fit	75
Multiple Choice Tests	73
Mathematics Tests	63
College Students	59
More ▼

Education Level

Higher Education	140
Postsecondary Education	104
Elementary Education	91
Secondary Education	81
High Schools	46
Elementary Secondary Education	45
Middle Schools	38
Early Childhood Education	34
Junior High Schools	28
Intermediate Grades	22
Primary Education	17
Grade 4	16
Grade 5	16
Grade 3	15
Grade 8	13
Grade 2	11
Grade 6	9
Preschool Education	9
Grade 7	8
Grade 1	7
Kindergarten	7
Grade 9	4
Grade 12	3
Adult Education	2
Grade 10	2
More ▼

Audience

Researchers	23
Teachers	4
Administrators	3
Counselors	3
Practitioners	3
Policymakers	2
Students	2

Location

Turkey	20
Canada	17
Germany	15
United States	12
China	10
Australia	9
Taiwan	9
Florida	7
Netherlands	7
South Korea	7
Nigeria	6
Saudi Arabia	6
Singapore	6
Spain	6
Iran	5
New York	5
France	4
Greece	4
Illinois	4
Indonesia	4
Italy	4
Japan	4
Nebraska	4
United Kingdom (England)	4
Africa	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	8
Individuals with Disabilities…	2
Elementary and Secondary…	1
Lau v Nichols	1
National Defense Education Act	1
Race to the Top	1

What Works Clearinghouse Rating

Test Items X

Showing 1 to 15 of 819 results Save | Export

Integration of Historical Data for the Analysis of Multiple Assessment Studies

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2023

Integrative data analyses have recently been shown to be an effective tool for researchers interested in synthesizing datasets from multiple studies in order to draw statistical or substantive conclusions. The actual process of integrating the different datasets depends on the availability of some common measures or items reflecting the same…

Descriptors: Data Analysis, Synthesis, Test Items, Simulation

Autism Knowledge Assessments: A Closer Examination of Validity by Autism Experts

Peer reviewed

Direct link

Camilla M. McMahon; Maryellen Brunson McClain; Savannah Wells; Sophia Thompson; Jeffrey D. Shahidullah – Journal of Autism and Developmental Disorders, 2025

Purpose: The goal of the current study was to conduct a substantive validity review of four autism knowledge assessments with prior psychometric support (Gillespie-Lynch in J Autism and Dev Disord 45(8):2553-2566, 2015; Harrison in J Autism and Dev Disord 47(10):3281-3295, 2017; McClain in J Autism and Dev Disord 50(3):998-1006, 2020; McMahon…

Descriptors: Measures (Individuals), Psychometrics, Test Items, Accuracy

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Developing an MLA-Test for Young Learners -- Insights from Measurement Theory and Language Testing

Peer reviewed

Direct link

Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025

This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…

Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

An Evaluation of Automatic Item Generation: A Case Study of Weak Theory Approach

Peer reviewed

Direct link

Fu, Yanyan; Choe, Edison M.; Lim, Hwanggyu; Choi, Jaehwa – Educational Measurement: Issues and Practice, 2022

This case study applied the "weak theory" of Automatic Item Generation (AIG) to generate isomorphic item instances (i.e., unique but psychometrically equivalent items) for a large-scale assessment. Three representative instances were selected from each item template (i.e., model) and pilot-tested. In addition, a new analytical framework,…

Descriptors: Test Items, Measurement, Psychometrics, Test Construction

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Estimating the Psychometric Properties ("Item Difficulty, Discrimination and Reliability Indices") of Test Items Using Kuder-Richardson Approach (KR-20)

Peer reviewed
PDF on ERIC

Download full text

Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023

There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…

Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis

Leveraging LLM Respondents for Item Evaluation: A Psychometric Analysis

Peer reviewed

Direct link

Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025

Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…

Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment

Assessment of Method Effects of Keying and Wording in Instruments: A Mixed-Methods Explanatory Sequential Study

Direct link

Lin Ma – ProQuest LLC, 2024

This dissertation presents an innovative approach to examining the keying method, wording method, and construct validity on psychometric instruments. By employing a mixed methods explanatory sequential design, the effects of keying and wording in two psychometric assessments were examined and validated. Those two self-report psychometric…

Descriptors: Evaluation, Psychometrics, Measures (Individuals), Instrumentation

Item-Validity Analysis of the SED-S in a Multicentre Study of Adults with Intellectual Disabilities

Peer reviewed

Direct link

Hauke Hermann; Annemieke Witte; Gloria Kempelmann; Brian F. Barrett; Sandra Zaal; Jolanda Vonk; Filip Morisse; Anna Pöhlmann; Paula S. Sterkenburg; Tanja Sappok – Journal of Applied Research in Intellectual Disabilities, 2024

Background: Valid and reliable instruments for measuring emotional development are critical for a proper diagnostic assignment in individuals with intellectual disabilities. This exploratory study examined the psychometric properties of the items on the Scale of Emotional Development--Short (SED-S). Method: The sample included 612 adults with…

Descriptors: Measures (Individuals), Emotional Development, Intellectual Disability, Psychometrics

An Application of Response Time Analysis in a Statewide Mathematics Assessment

Direct link

Mingjia Ma – ProQuest LLC, 2023

Response time is an important research topic in the field of psychometrics. This dissertation tries to explore some response time properties across several item characteristics and examinee characteristics, as well as the interactions between response time and response outcomes, using data from a statewide mathematics assessment in two grades.…

Descriptors: Reaction Time, Mathematics Tests, Standardized Tests, State Standards

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 55

Educational and Psychological…	43
ProQuest LLC	43
Journal of Educational…	28
Psychometrika	26
Grantee Submission	25
Applied Measurement in…	22
ETS Research Report Series	19
International Journal of…	16
Journal of Psychoeducational…	16
Applied Psychological…	15
Educational Measurement:…	13
Measurement:…	11
Online Submission	11
College Board	10
Journal of Educational and…	10
Journal of Applied Testing…	8
International Journal of…	7
Language Testing	7
Measurement and Evaluation in…	7
Measurement in Physical…	7
Psychological Assessment	7
Research in Developmental…	7
SAGE Open	7
Journal of Intelligence	6
Journal of Speech, Language,…	6
More ▼

Gierl, Mark J.	13
Dorans, Neil J.	8
Liu, Ou Lydia	7
Schoen, Robert C.	7
Reckase, Mark D.	6
Bejar, Isaac I.	5
Embretson, Susan E.	5
Katz, Irvin R.	5
Mislevy, Robert J.	5
Sinharay, Sandip	5
Baghaei, Purya	4
Cui, Ying	4
Dimitrov, Dimiter M.	4
Holling, Heinz	4
Lai, Hollis	4
Lord, Frederic M.	4
Paek, Insu	4
Thompson, Bruce	4
Wainer, Howard	4
Wang, Changjiang	4
Wilson, Mark	4
Wright, Benjamin D.	4
Abedi, Jamal	3
Boone, William J.	3
More ▼

Journal Articles	605
Reports - Research	506
Reports - Evaluative	155
Reports - Descriptive	64
Speeches/Meeting Papers	61
Dissertations/Theses -…	43
Tests/Questionnaires	34
Numerical/Quantitative Data	18
Information Analyses	14
Opinion Papers	12
Books	9
Guides - Non-Classroom	6
Book/Product Reviews	4
Guides - General	4
Non-Print Media	4
Collected Works - General	3
ERIC Publications	3
Reports - General	3
Collected Works - Proceedings	2
ERIC Digests in Full Text	2
Guides - Classroom - Teacher	2
Historical Materials	2
Reference Materials - General	2
Guides - Classroom - Learner	1
Reference Materials -…	1
More ▼

SAT (College Admission Test)	15
Trends in International…	12
Graduate Record Examinations	9
National Assessment of…	6
Raven Progressive Matrices	6
ACT Assessment	5
Program for International…	5
Test of English as a Foreign…	4
Advanced Placement…	3
Beck Depression Inventory	3
Peabody Picture Vocabulary…	3
Preliminary Scholastic…	3
Armed Services Vocational…	2
Autism Diagnostic Observation…	2
Childrens Manifest Anxiety…	2
Comprehensive Tests of Basic…	2
Dynamic Indicators of Basic…	2
Flesch Kincaid Grade Level…	2
Hidden Figures Test	2
Law School Admission Test	2
Myers Briggs Type Indicator	2
NEO Personality Inventory	2
Peabody Developmental Motor…	2
Progress in International…	2
Social Skills Improvement…	2
More ▼