Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 40 |
Descriptor
Comparative Analysis | 51 |
Computer Assisted Testing | 51 |
Models | 51 |
Test Items | 16 |
Adaptive Testing | 15 |
Foreign Countries | 13 |
Item Response Theory | 13 |
Scoring | 12 |
Correlation | 11 |
Simulation | 10 |
Computer Software | 9 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 17 |
Postsecondary Education | 14 |
Elementary Secondary Education | 7 |
Secondary Education | 7 |
High Schools | 5 |
Elementary Education | 4 |
Junior High Schools | 4 |
Middle Schools | 4 |
Grade 10 | 2 |
Grade 7 | 2 |
Grade 9 | 2 |
More ▼ |
Audience
Practitioners | 1 |
Researchers | 1 |
Students | 1 |
Location
Australia | 4 |
Connecticut | 3 |
Netherlands | 3 |
United Kingdom (England) | 3 |
France | 2 |
Germany | 2 |
Israel | 2 |
New Hampshire | 2 |
New York | 2 |
Pennsylvania | 2 |
Rhode Island | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024
Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…
Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods
Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023
Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…
Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests
Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022
In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…
Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis
Yi Gui – ProQuest LLC, 2024
This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…
Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring
Carioti, Desiré; Stucchi, Natale Adolfo; Toneatto, Carlo; Masia, Marta Franca; Del Monte, Milena; Stefanelli, Silvia; Travellini, Simona; Marcelli, Antonella; Tettamanti, Marco; Vernice, Mirta; Guasti, Maria Teresa; Berlingeri, Manuela – Annals of Dyslexia, 2023
In this study, we validated the "ReadFree tool", a computerised battery of 12 visual and auditory tasks developed to identify poor readers also in minority-language children (MLC). We tested the task-specific discriminant power on 142 Italian-monolingual participants (8-13 years old) divided into monolingual poor readers (N = 37) and…
Descriptors: Language Minorities, Task Analysis, Italian, Monolingualism
Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021
Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…
Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Ningsih, Tutuk; Yuwono, Dwi Margo; Sholehuddin, M. Sugeng; Suharto, Abdul Wachid Bambang – Journal of Social Studies Education Research, 2021
Learning at home not only provides written assignments that are changed in electronic form but must also reflect student learning outcomes at home. Likewise, researchers use literary reading to avoid students getting bored with learning Indonesian language literacy and character education. However, improving literacy skills is not just reading…
Descriptors: Indonesian, Computer Assisted Testing, Fiction, Literacy
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Mekni Toujani, Marwa – Discourse Processes: A Multidisciplinary Journal, 2020
One of the major aims of discourse-processing literature is to understand whether and when readers form discourse-level representations online. To test this, two word-by-word, self-paced reading experiments investigated the time course of integrating incoming information about the protagonist into the unfolding discourse-level representation in…
Descriptors: Semitic Languages, Native Language, Discourse Analysis, Reading Processes
Ortin, Ramses; Fernandez-Florez, Carmen – International Journal of Multilingualism, 2019
Research on linguistic variation suggests that usage patterns are deeply embedded in native and non-native speakers' knowledge of grammar. This study explores the transfer of these variable sociolinguistic patterns at the initial stages of third language acquisition. We elicited narratives in Portuguese from two mirror-image groups of sequential…
Descriptors: Grammar, Transfer of Training, Multilingualism, Second Language Learning
Marianti, Sukaesi; Fox, Jean-Paul; Avetisyan, Marianna; Veldkamp, Bernard P.; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2014
Many standardized tests are now administered via computer rather than paper-and-pencil format. In a computer-based testing environment, it is possible to record not only the test taker's response to each question (item) but also the amount of time spent by the test taker in considering and answering each item. Response times (RTs) provide…
Descriptors: Reaction Time, Response Style (Tests), Computer Assisted Testing, Bayesian Statistics
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Han, Kyung T.; Rudner, Lawrence M. – Graduate Management Admission Council, 2014
This study uses mixed integer quadratic programming (MIQP) to construct multiple highly equivalent item pools simultaneously, and compares the results from mixed integer programming (MIP). Three different MIP/MIQP models were implemented and evaluated using real CAT item pool data with 23 different content areas and a goal of equal information…
Descriptors: Item Banks, Programming, Computer Assisted Testing, Adaptive Testing
Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2015
Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…
Descriptors: Computer Assisted Testing, Adaptive Testing, Accuracy, Fidelity