ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	43

Descriptor

Comparative Analysis	69
Computer Assisted Testing	69
Models	51
Adaptive Testing	25
Test Items	20
Item Response Theory	17
Mathematical Models	16
Simulation	15
Foreign Countries	14
Scoring	14
Correlation	13
Test Construction	11
College Students	9
Computer Software	9
Higher Education	9
Item Analysis	9
Scores	9
Student Evaluation	9
Test Format	9
Essays	8
Estimation (Mathematics)	8
Factor Analysis	8
Prediction	8
Test Length	8
Test Reliability	8
More ▼

Publication Type

Reports - Research	44
Journal Articles	39
Reports - Evaluative	13
Speeches/Meeting Papers	11
Dissertations/Theses -…	4
Collected Works - Proceedings	3
Reports - Descriptive	3
Information Analyses	2
Books	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	19
Postsecondary Education	15
Secondary Education	9
Elementary Secondary Education	7
Middle Schools	6
Elementary Education	5
High Schools	5
Junior High Schools	5
Grade 10	2
Grade 4	2
Grade 7	2
Grade 9	2
Intermediate Grades	2
Adult Education	1
Early Childhood Education	1
Grade 11	1
Grade 12	1
Grade 5	1
Grade 6	1
Grade 8	1
Kindergarten	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	2
Practitioners	1
Students	1

Location

Australia	4
Connecticut	3
Netherlands	3
United Kingdom (England)	3
France	2
Germany	2
Israel	2
New Hampshire	2
New York	2
Pennsylvania	2
Rhode Island	2
Singapore	2
Spain	2
Vermont	2
Asia	1
Brazil	1
Czech Republic	1
Denmark	1
Egypt	1
Estonia	1
Finland	1
Florida	1
Greece	1
Hawaii	1
Indonesia	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

National Assessment of…	2
New York State Regents…	2
Program for International…	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
COMPASS (Computer Assisted…	1
College Board Achievement…	1
Graduate Management Admission…	1
Law School Admission Test	1
Learning and Study Strategies…	1
Massachusetts Comprehensive…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 69 results Save | Export

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Individual Fairness Evaluation for Automated Essay Scoring System

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022

In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…

Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis

Developing a Generic Scorer for Practice Writing Tests of Statewide Assessment Essays with Natural Language Processing Transfer Learning Techniques

Direct link

Yi Gui – ProQuest LLC, 2024

This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…

Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring

The ReadFree Tool for the Identification of Poor Readers: A Validation Study Based on a Machine Learning Approach in Monolingual and Minority-Language Children

Peer reviewed

Direct link

Carioti, Desiré; Stucchi, Natale Adolfo; Toneatto, Carlo; Masia, Marta Franca; Del Monte, Milena; Stefanelli, Silvia; Travellini, Simona; Marcelli, Antonella; Tettamanti, Marco; Vernice, Mirta; Guasti, Maria Teresa; Berlingeri, Manuela – Annals of Dyslexia, 2023

In this study, we validated the "ReadFree tool", a computerised battery of 12 visual and auditory tasks developed to identify poor readers also in minority-language children (MLC). We tested the task-specific discriminant power on 142 Italian-monolingual participants (8-13 years old) divided into monolingual poor readers (N = 37) and…

Descriptors: Language Minorities, Task Analysis, Italian, Monolingualism

Binding Costs in Processing Efficiency as Determinants of Cognitive Ability

Peer reviewed
PDF on ERIC

Download full text

Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021

Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…

Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time

The Significant of E-Assessment for Indonesian Literacy with Character Education in Pandemic Era

Peer reviewed
PDF on ERIC

Download full text

Ningsih, Tutuk; Yuwono, Dwi Margo; Sholehuddin, M. Sugeng; Suharto, Abdul Wachid Bambang – Journal of Social Studies Education Research, 2021

Learning at home not only provides written assignments that are changed in electronic form but must also reflect student learning outcomes at home. Likewise, researchers use literary reading to avoid students getting bored with learning Indonesian language literacy and character education. However, improving literacy skills is not just reading…

Descriptors: Indonesian, Computer Assisted Testing, Fiction, Literacy

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Using a Randomized Experiment to Compare the Performance of Two Adaptive Assessment Engines

Peer reviewed
PDF on ERIC

Download full text

Matayoshi, Jeffrey; Uzun, Hasan; Cosyn, Eric – International Educational Data Mining Society, 2022

Knowledge space theory (KST) is a mathematical framework for modeling and assessing student knowledge. While KST has successfully served as the foundation of several learning systems, recent advancements in machine learning provide an opportunity to improve on purely KST-based approaches to assessing student knowledge. As such, in this work we…

Descriptors: Knowledge Level, Mathematical Models, Learning Experience, Comparative Analysis

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

The Time-Course of Generating Discourse-Level Representations in Tunisian Arabic: Effects of Task Demands on Detecting Character-Attribute Anomalies

Peer reviewed

Direct link

Mekni Toujani, Marwa – Discourse Processes: A Multidisciplinary Journal, 2020

One of the major aims of discourse-processing literature is to understand whether and when readers form discourse-level representations online. To test this, two word-by-word, self-paced reading experiments investigated the time course of integrating incoming information about the protagonist into the unfolding discourse-level representation in…

Descriptors: Semitic Languages, Native Language, Discourse Analysis, Reading Processes

Progress Monitoring with Computer Adaptive Assessments: The Impact of Data Collection Schedule on Growth Estimates

Peer reviewed

Direct link

Nelson, Peter M.; Van Norman, Ethan R.; Klingbeil, Dave A.; Parker, David C. – Psychology in the Schools, 2017

Although extensive research exists on the use of curriculum-based measures for progress monitoring, little is known about using computer adaptive tests (CATs) for progress-monitoring purposes. The purpose of this study was to evaluate the impact of the frequency of data collection on individual and group growth estimates using a CAT. Data were…

Descriptors: Progress Monitoring, Computer Assisted Testing, Data Collection, Scheduling

Transfer of Variable Grammars in Third Language Acquisition

Peer reviewed

Direct link

Ortin, Ramses; Fernandez-Florez, Carmen – International Journal of Multilingualism, 2019

Research on linguistic variation suggests that usage patterns are deeply embedded in native and non-native speakers' knowledge of grammar. This study explores the transfer of these variable sociolinguistic patterns at the initial stages of third language acquisition. We elicited narratives in Portuguese from two mirror-image groups of sequential…

Descriptors: Grammar, Transfer of Training, Multilingualism, Second Language Learning

Testing for Aberrant Behavior in Response Time Modeling

Peer reviewed

Direct link

Marianti, Sukaesi; Fox, Jean-Paul; Avetisyan, Marianna; Veldkamp, Bernard P.; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2014

Many standardized tests are now administered via computer rather than paper-and-pencil format. In a computer-based testing environment, it is possible to record not only the test taker's response to each question (item) but also the amount of time spent by the test taker in considering and answering each item. Response times (RTs) provide…

Descriptors: Reaction Time, Response Style (Tests), Computer Assisted Testing, Bayesian Statistics

Probing the Relative Importance of Different Attributes in L2 Reading and Listening Comprehension Items: An Application of Cognitive Diagnostic Models

Peer reviewed

Direct link

Yi, Yeon-Sook – Language Testing, 2017

The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…

Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Applied Psychological…	4
International Educational…	4
Journal of Educational and…	4
ProQuest LLC	4
Applied Measurement in…	3
ETS Research Report Series	3
Educational and Psychological…	3
Journal of Educational…	3
Journal of Intelligence	2
ACT, Inc.	1
Annals of Dyslexia	1
Assessing Writing	1
Community College Journal of…	1
Council of Chief State School…	1
Discourse Processes: A…	1
EURASIA Journal of…	1
Educational Research and…	1
Educational Technology…	1
Graduate Management Admission…	1
Intelligence	1
International Association for…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
Journal of Educational…	1
More ▼

De Ayala, R. J.	3
Darling-Hammond, Linda	2
Frick, Theodore W.	2
Seo, Dong Gi	2
Stocking, Martha L.	2
Veldkamp, Bernard P.	2
Wainer, Howard	2
Abad, Francisco J.	1
Arendasy, Martin E.	1
Ariel, Adelaide	1
Attali, Yigal	1
Avetisyan, Marianna	1
Barnes, Tiffany, Ed.	1
Baron, Simon	1
Bejar, Isaac I.	1
Berger, Martijn P. F.	1
Berlingeri, Manuela	1
Bernard, David	1
Breyer, F. Jay	1
Carioti, Desiré	1
Chen, Haiwen	1
Chien, Yuehmei	1
Cosyn, Eric	1
Crehan, Kevin D.	1
More ▼