ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	25
Since 2006 (last 20 years)	48

Descriptor

Scores	86
Test Length	86
Test Items	27
Item Response Theory	26
Test Reliability	23
Computer Assisted Testing	17
Comparative Analysis	15
Error of Measurement	15
Test Construction	15
Test Format	15
Simulation	13
Statistical Analysis	13
Reliability	11
Test Validity	11
Language Tests	10
Psychometrics	10
Sample Size	10
Higher Education	9
Foreign Countries	8
Adaptive Testing	7
Bayesian Statistics	7
Computation	7
Estimation (Mathematics)	7
Goodness of Fit	7
Language Proficiency	7
More ▼

Publication Type

Reports - Research	58
Journal Articles	54
Reports - Evaluative	21
Speeches/Meeting Papers	13
Dissertations/Theses -…	3
Numerical/Quantitative Data	2
Reports - Descriptive	2
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Postsecondary Education	5
Elementary Education	4
Middle Schools	4
Secondary Education	4
Elementary Secondary Education	3
High Schools	3
Junior High Schools	2
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Netherlands	2
Asia	1
Canada	1
China	1
Florida	1
Iran	1
Michigan	1
South Korea	1
Turkey	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

An Empirical Evaluation of Lexical Diversity Indices in L2 Korean Writing Assessment

Peer reviewed

Direct link

Hakyung Sung; Sooyeon Cho; Kristopher Kyle – Language Assessment Quarterly, 2024

Lexical diversity (LD) is an important indicator of second language lexical development. Much research has investigated LD indices, with a focus on learners of English. However, further research is needed in languages that are typologically distinct from English, such as Korean. In this study, we evaluated the reliability and validity of LD…

Descriptors: Second Language Learning, Korean, Persuasive Discourse, Language Tests

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

Multidimensional Forced-Choice CAT with Dominance Items: An Empirical Comparison with Optimal Static Testing under Different Desirability Matching

Peer reviewed

Direct link

Lin, Yin; Brown, Anna; Williams, Paul – Educational and Psychological Measurement, 2023

Several forced-choice (FC) computerized adaptive tests (CATs) have emerged in the field of organizational psychology, all of them employing ideal-point items. However, despite most items developed historically follow dominance response models, research on FC CAT using dominance items is limited. Existing research is heavily dominated by…

Descriptors: Measurement Techniques, Computer Assisted Testing, Adaptive Testing, Industrial Psychology

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Computerized and Traditional Administration of Questionnaires: Psychometric Quality and Completion Time for Measures of Self-Concept

Peer reviewed

Direct link

Vispoel, Walter Peter; Morris, Carrie Ann; Sun, Linan – Journal of Experimental Education, 2019

In two independent studies of questionnaire administration, respondents completed multidimensional self-concept inventories within four randomized research conditions that mirrored the most common administration formats used in practice: paper booklets with and without answer sheets and computer questionnaires with single versus multiple items per…

Descriptors: Self Concept Measures, Computer Assisted Testing, Questionnaires, Psychometrics

Test Review: Current Options in At-Home Language Proficiency Tests for Making High-Stakes Decisions

Peer reviewed

Direct link

Isbell, Daniel R.; Kremmel, Benjamin – Language Testing, 2020

Administration of high-stakes language proficiency tests has been disrupted in many parts of the world as a result of the 2019 novel coronavirus pandemic. Institutions that rely on test scores have been forced to adapt, and in many cases this means using scores from a different test, or a new online version of an existing test, that can be taken…

Descriptors: Language Tests, High Stakes Tests, Language Proficiency, Second Language Learning

Interaction of Proctoring and Student Major on Online Test Performance

Peer reviewed
PDF on ERIC

Download full text

Alessio, Helaine M.; Malay, Nancy; Maurer, Karsten; Bailer, A. John; Rubin, Beth – International Review of Research in Open and Distributed Learning, 2018

Traditional and online university courses share expectations for quality content and rigor. Student and faculty concerns about compromised academic integrity and actual instances of academic dishonesty in assessments, especially with online testing, are increasingly troublesome. Recent research suggests that in the absence of proctoring, the time…

Descriptors: Supervision, Majors (Students), Computer Assisted Testing, Scores

A Shorter Short Version of Barron's Ego Strength Scale

Peer reviewed

Direct link

Kelly, William E.; Daughtry, Don – College Student Journal, 2018

This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…

Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

Test Review: TestDaF

Peer reviewed

Direct link

Norris, John; Drackert, Anastasia – Language Testing, 2018

The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…

Descriptors: German, Second Language Learning, Language Tests, Language Proficiency

Designing CAT MOCCA: Guiding Principles and Simulation Research. MOCCA Technical Report MTR-2021-1

Peer reviewed
PDF on ERIC

Download full text

Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021

MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…

Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Applied Psychological…	6
Journal of Educational…	6
Educational and Psychological…	5
Applied Measurement in…	4
ETS Research Report Series	4
Language Testing	4
Grantee Submission	3
ProQuest LLC	3
Educational Measurement:…	2
International Journal of…	2
ACT, Inc.	1
Asia Pacific Education Review	1
Assessment & Evaluation in…	1
Center on Children and…	1
College Board	1
College Entrance Examination…	1
College Student Journal	1
ERS Spectrum	1
Education and Information…	1
European Journal of Science…	1
European Journal of Special…	1
International Review of…	1
Journal of College Reading…	1
Journal of Educational…	1
Journal of Experimental…	1
More ▼

Hambleton, Ronald K.	3
Livingston, Samuel A.	3
Kolen, Michael J.	2
Lee, Won-Chan	2
Lee, Yi-Hsuan	2
Lewis, Charles	2
McBride, James R.	2
Meijer, Rob R.	2
Sijtsma, Klaas	2
Zhang, Jinming	2
Alessio, Helaine M.	1
Allen, Nancy L.	1
Allspach, Jill R.	1
Anthony, Christopher James	1
Baba, Kyoko	1
Bailer, A. John	1
Bauer, Ernest A.	1
Bazaldua, Diego A. Luna	1
Bridgeman, Brent	1
Brown, Anna	1
Bruce, K.	1
Burton, Nancy	1
Burton, Richard F.	1
Campbell, Todd	1
More ▼

Test of English as a Foreign…	4
SAT (College Admission Test)	3
Graduate Record Examinations	2
Trends in International…	2
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Armed Forces Qualification…	1
Bem Sex Role Inventory	1
California Critical Thinking…	1
California Psychological…	1
Florida Comprehensive…	1
International English…	1
Iowa Tests of Basic Skills	1
Matching Familiar Figures Test	1
Peabody Picture Vocabulary…	1
Preliminary Scholastic…	1
Program for International…	1
Self Description Questionnaire	1
Watson Glaser Critical…	1
More ▼