ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	29
Since 2006 (last 20 years)	55

Descriptor

Computer Assisted Testing	73
Item Response Theory	73
Scores	73
Test Items	28
Adaptive Testing	26
Comparative Analysis	17
Foreign Countries	15
Scoring	14
Psychometrics	13
Simulation	12
Test Construction	12
Test Format	12
Test Reliability	11
Test Validity	11
Models	10
Statistical Analysis	10
Accuracy	8
Correlation	8
Difficulty Level	7
English (Second Language)	7
Item Analysis	7
Language Tests	7
Achievement Tests	6
Elementary School Students	6
Error of Measurement	6
More ▼

Publication Type

Journal Articles	55
Reports - Research	52
Reports - Evaluative	12
Speeches/Meeting Papers	6
Numerical/Quantitative Data	4
Tests/Questionnaires	3
Collected Works - Proceedings	2
Reports - Descriptive	2
Book/Product Reviews	1
Books	1
Dissertations/Theses -…	1
Guides - General	1
Information Analyses	1
Opinion Papers	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	10
Elementary Education	6
Secondary Education	6
Elementary Secondary Education	4
High Schools	3
Junior High Schools	3
Middle Schools	3
Grade 9	2
Intermediate Grades	2
Adult Education	1
Early Childhood Education	1
Grade 11	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Primary Education	1
More ▼

Audience

Practitioners	1
Researchers	1
Students	1

Location

Germany	2
Indonesia	2
Taiwan	2
Turkey	2
United States	2
Arkansas	1
Australia	1
Canada	1
Colorado	1
Denmark	1
District of Columbia	1
Finland	1
Florida	1
France	1
Illinois	1
Ireland	1
Malaysia	1
Maryland	1
Massachusetts	1
Minnesota	1
Mississippi	1
Netherlands	1
New Jersey	1
New Mexico	1
New Zealand	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Graduate Record Examinations	2
Indiana Statewide Testing for…	2
Raven Progressive Matrices	2
ACT Assessment	1
Armed Forces Qualification…	1
Minnesota Multiphasic…	1
NEO Personality Inventory	1
Peabody Picture Vocabulary…	1
Program for International…	1
Program for the International…	1
Stanford Achievement Tests	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 73 results Save | Export

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Multidimensional Forced-Choice CAT with Dominance Items: An Empirical Comparison with Optimal Static Testing under Different Desirability Matching

Peer reviewed

Direct link

Lin, Yin; Brown, Anna; Williams, Paul – Educational and Psychological Measurement, 2023

Several forced-choice (FC) computerized adaptive tests (CATs) have emerged in the field of organizational psychology, all of them employing ideal-point items. However, despite most items developed historically follow dominance response models, research on FC CAT using dominance items is limited. Existing research is heavily dominated by…

Descriptors: Measurement Techniques, Computer Assisted Testing, Adaptive Testing, Industrial Psychology

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Peer reviewed

Direct link

Uto, Masaki; Okano, Masashi – IEEE Transactions on Learning Technologies, 2021

In automated essay scoring (AES), scores are automatically assigned to essays as an alternative to grading by humans. Traditional AES typically relies on handcrafted features, whereas recent studies have proposed AES models based on deep neural networks to obviate the need for feature engineering. Those AES models generally require training on a…

Descriptors: Essays, Scoring, Writing Evaluation, Item Response Theory

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Does Item Format Affect Test Security?

Peer reviewed
PDF on ERIC

Download full text

Gorney, Kylie; Wollack, James A. – Practical Assessment, Research & Evaluation, 2022

Unlike the traditional multiple-choice (MC) format, the discrete-option multiple-choice (DOMC) format does not necessarily reveal all answer options to an examinee. The purpose of this study was to determine whether the reduced exposure of item content affects test security. We conducted an experiment in which participants were allowed to view…

Descriptors: Test Items, Test Format, Multiple Choice Tests, Item Analysis

Application of Network Analysis to Description and Prediction of Assessment Outcomes

Peer reviewed

Direct link

Thompson, James J. – Measurement: Interdisciplinary Research and Perspectives, 2022

With the use of computerized testing, ordinary assessments can capture both answer accuracy and answer response time. For the Canadian Programme for the International Assessment of Adult Competencies (PIAAC) numeracy and literacy subtests, person ability, person speed, question difficulty, question time intensity, fluency (rate), person fluency…

Descriptors: Foreign Countries, Adults, Computer Assisted Testing, Network Analysis

Young Children's Actions on Length Measurement Tasks: Strategies and Cognitive Attributes

Peer reviewed

Direct link

Clements, Douglas H.; Banse, Holland; Sarama, Julie; Tatsuoka, Curtis; Joswick, Candace; Hudyma, Aaron; Van Dine, Douglas W.; Tatsuoka, Kikumi K. – Mathematical Thinking and Learning: An International Journal, 2022

Researchers often develop instruments using correctness scores (and a variety of theories and techniques, such as Item Response Theory) for validation and scoring. Less frequently, observations of children's strategies are incorporated into the design, development, and application of assessments. We conducted individual interviews of 833…

Descriptors: Item Response Theory, Computer Assisted Testing, Test Items, Mathematics Tests

Psychometric Properties of Alternative Item Types Worth the Squeeze? An Investigation into the Psychometric Performance of Alternative Item Types

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Foley, Brett P.; Zurn, Jared – Journal of Applied Testing Technology, 2021

As assessments move from traditional paper-pencil administration to computer-based administration, many testing programs are incorporating alternative item types (AITs) into assessments with the goals of measuring higher-order thinking, offering insight into problem-solving, and representing authentic real-world tasks. This paper explores multiple…

Descriptors: Psychometrics, Alternative Assessment, Computer Assisted Testing, Test Items

Developing and Validating a Computerized Adaptive Testing System for Measuring the English Proficiency of Taiwanese EFL University Students

Peer reviewed

Direct link

Huang, Heng-Tsung Danny; Hung, Shao-Ting Alan; Chao, Hsiu-Yi; Chen, Jyun-Hong; Lin, Tsui-Peng; Shih, Ching-Lin – Language Assessment Quarterly, 2022

Prompted by Taiwanese university students' increasing demand for English proficiency assessment, the absence of a test designed specifically for this demographic subgroup, and the lack of a localized and freely-accessible proficiency measure, this project set out to develop and validate a computerized adaptive English proficiency testing (E-CAT)…

Descriptors: Computer Assisted Testing, English (Second Language), Second Language Learning, Second Language Instruction

Development of a Multidimensional Computerized Adaptive Test Based on the Bifactor Model

Peer reviewed
PDF on ERIC

Download full text

Sahin, Murat Dogan; Gelbal, Selahattin – International Journal of Assessment Tools in Education, 2020

The purpose of this study was to conduct a real-time multidimensional computerized adaptive test (MCAT) using data from a previous paper-pencil test (PPT) regarding the grammar and vocabulary dimensions of an end-of-term proficiency exam conducted on students in a preparatory class at a university. An item pool was established through four…

Descriptors: Adaptive Testing, Computer Assisted Testing, Language Tests, Language Proficiency

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	6
ETS Research Report Series	5
Journal of Educational…	5
Applied Psychological…	4
Applied Measurement in…	3
IEEE Transactions on Learning…	3
Language Assessment Quarterly	3
Educational Technology &…	2
International Educational…	2
International Journal of…	2
Journal of Applied Testing…	2
Measurement:…	2
Partnership for Assessment of…	2
Psychometrika	2
ACT, Inc.	1
Assessment	1
Computer Assisted Language…	1
EURASIA Journal of…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Educational Testing Service	1
Florida Center for Reading…	1
Grantee Submission	1
Intelligence	1
International Journal of…	1
More ▼

Wise, Steven L.	4
Andrich, David	2
Capar, Nilufer K.	2
Choi, Seung W.	2
Davey, Tim	2
Foorman, Barbara R.	2
Keng, Leslie	2
Kim, Dong-In	2
Meijer, Rob R.	2
Petscher, Yaacov	2
Rizavi, Saba	2
Sinharay, Sandip	2
Steedle, Jeffrey	2
Sykes, Robert C.	2
Uto, Masaki	2
Wan, Ping	2
Abad, Francisco J.	1
Aomi, Itsuki	1
Banse, Holland	1
Barros, Beatriz	1
Ben-Porath, Yossef S.	1
Berberoglu, Giray	1
Bertoa, Manuel F.	1
Bond, Trevor	1
More ▼