ERIC - Search Results

Publication Date

In 2025	3
Since 2024	18
Since 2021 (last 5 years)	69
Since 2016 (last 10 years)	161
Since 2006 (last 20 years)	317

Descriptor

Test Length	624
Test Items	218
Item Response Theory	197
Test Construction	149
Sample Size	137
Test Reliability	130
Computer Assisted Testing	117
Test Validity	108
Simulation	107
Adaptive Testing	98
Comparative Analysis	96
Test Format	88
Scores	86
Error of Measurement	75
Statistical Analysis	71
Correlation	68
Foreign Countries	68
Item Analysis	65
Computation	61
Higher Education	61
Models	61
Difficulty Level	57
Accuracy	55
Testing Problems	54
Monte Carlo Methods	51
More ▼

Education Level

Higher Education	44
Postsecondary Education	36
Elementary Education	21
Secondary Education	18
Middle Schools	11
Elementary Secondary Education	10
High Schools	9
Early Childhood Education	8
Junior High Schools	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
Illinois (Chicago)	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Test Length X

Showing 16 to 30 of 624 results Save | Export

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

An Empirical Evaluation of Lexical Diversity Indices in L2 Korean Writing Assessment

Peer reviewed

Direct link

Hakyung Sung; Sooyeon Cho; Kristopher Kyle – Language Assessment Quarterly, 2024

Lexical diversity (LD) is an important indicator of second language lexical development. Much research has investigated LD indices, with a focus on learners of English. However, further research is needed in languages that are typologically distinct from English, such as Korean. In this study, we evaluated the reliability and validity of LD…

Descriptors: Second Language Learning, Korean, Persuasive Discourse, Language Tests

Moving from High-Stakes Final Exams towards Multiple Small Tests during the Semester

Peer reviewed

Direct link

Niclas Larson – Journal of the International Society for Teacher Education, 2024

This paper reports on a revision of the assessment model from the first mathematics course for pre-service teachers (PSTs) aiming for grades 5-10, at a Norwegian university. The weight of the final written exam was reduced and a new, mastery-based testing model, with weekly small tests, was introduced. Results from this study show that the PSTs…

Descriptors: High Stakes Tests, Test Length, Mathematics Tests, Preservice Teachers

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Construction and Factorial Validation of a Short Version of the Academic Motivation Scale

Peer reviewed

Direct link

Kotera, Yasuhiro; Conway, Elaine; Green, Pauline – British Journal of Guidance & Counselling, 2023

Academic motivation is important to students' mental health and performance. One established measure is the Academic Motivation Scale (AMS), comprising 28 items. AMS assesses intrinsic motivation, extrinsic motivation, and amotivation, which are further categorised into seven subscales. One weakness of AMS is its length. In this study, we…

Descriptors: Test Construction, Test Validity, Factor Analysis, Learning Motivation

Type I Error and Power Rates: A Comparative Analysis of Techniques in Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…

Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

Validating the MUSIC Model of Academic Motivation Inventory: Evidence for the Short Forms of the College Student Version

Peer reviewed

Direct link

Jones, Brett D.; Wilkins, Jesse L. M. – Journal of Psychoeducational Assessment, 2023

The purpose of this study was to investigate the validity evidence for the use of the 19-item and 20-item short forms of the MUSIC Model of Academic Motivation Inventory (College Student version) with undergraduate students. These shorter forms of the MUSIC Inventory could be beneficial to teachers and researchers. Our analysis included inventory…

Descriptors: Test Validity, Learning Motivation, Test Length, Undergraduate Students

A Comparison of Common IRT Model-Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021

To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…

Descriptors: Item Response Theory, Test Format, Selection, Methods

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 42

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	28
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	13
Psychological Assessment	12
International Journal of…	11
Psychometrika	10
Measurement:…	9
International Journal of…	8
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
Perceptual and Motor Skills	3
Physical Review Physics…	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	411
Journal Articles	393
Reports - Evaluative	124
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	21
Numerical/Quantitative Data	14
Guides - Non-Classroom	11
Tests/Questionnaires	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Program for International…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
Academic Motivation Scale	1
More ▼