Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 8 |
| Since 2017 (last 10 years) | 43 |
| Since 2007 (last 20 years) | 139 |
Descriptor
| Comparative Analysis | 204 |
| Models | 204 |
| Hypothesis Testing | 84 |
| Computer Assisted Testing | 51 |
| Foreign Countries | 43 |
| Statistical Analysis | 35 |
| Testing | 32 |
| Correlation | 30 |
| Item Response Theory | 28 |
| Test Items | 28 |
| Evaluation Methods | 27 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 49 |
| Postsecondary Education | 33 |
| Elementary Secondary Education | 18 |
| Secondary Education | 16 |
| Elementary Education | 14 |
| High Schools | 10 |
| Middle Schools | 8 |
| Grade 8 | 7 |
| Junior High Schools | 7 |
| Grade 4 | 6 |
| Grade 7 | 3 |
| More ▼ | |
Audience
| Researchers | 4 |
| Practitioners | 2 |
| Teachers | 2 |
| Students | 1 |
Location
| Australia | 8 |
| Netherlands | 8 |
| United States | 6 |
| Germany | 5 |
| Canada | 4 |
| Florida | 4 |
| Indonesia | 4 |
| Connecticut | 3 |
| Greece | 3 |
| Israel | 3 |
| North Carolina | 3 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 2 |
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024
Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…
Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods
W. Jake Thompson – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…
Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit
Yixi Wang – ProQuest LLC, 2020
Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…
Descriptors: Item Response Theory, Educational Testing, Data, Models
Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023
Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…
Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests
Mao, Jih-Yu; Xiao, Jincen; Liu, Xin; Qing, Tao; Xu, Hongling – Creativity Research Journal, 2023
In this research, we explore how coworker ideation levels or, more specifically, the average ideation levels of coworkers within a workgroup affect a focal employee's ideation. We examine an underlying mechanism and a boundary condition of this influence process. Drawing on social cognitive theory, we argue that high coworker ideation levels are…
Descriptors: Work Environment, Employee Attitudes, Creativity, Self Efficacy
Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022
In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…
Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis
Yi Gui – ProQuest LLC, 2024
This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…
Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring
Carioti, Desiré; Stucchi, Natale Adolfo; Toneatto, Carlo; Masia, Marta Franca; Del Monte, Milena; Stefanelli, Silvia; Travellini, Simona; Marcelli, Antonella; Tettamanti, Marco; Vernice, Mirta; Guasti, Maria Teresa; Berlingeri, Manuela – Annals of Dyslexia, 2023
In this study, we validated the "ReadFree tool", a computerised battery of 12 visual and auditory tasks developed to identify poor readers also in minority-language children (MLC). We tested the task-specific discriminant power on 142 Italian-monolingual participants (8-13 years old) divided into monolingual poor readers (N = 37) and…
Descriptors: Language Minorities, Task Analysis, Italian, Monolingualism
Prevodnik, Katja; Vehovar, Vasja – Sociological Methods & Research, 2023
When comparing social science phenomena through a time perspective, absolute and relative difference (RD) are the two typical presentation formats used to communicate interpretations to the audience, while time distance (TD) is the least frequently used of such formats. This article argues that the chosen presentation format is extremely important…
Descriptors: Comparative Analysis, Social Science Research, Public Agencies, College Faculty
Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021
Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…
Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time
Douven, Igor; Mirabile, Patricia – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
There is a wealth of evidence that people's reasoning is influenced by explanatory considerations. Little is known, however, about the exact form this influence takes, for instance about whether the influence is unsystematic or because of people's following some rule. Three experiments investigate the descriptive adequacy of a precise proposal to…
Descriptors: Probability, Bayesian Statistics, Hypothesis Testing, Thinking Skills
Ningsih, Tutuk; Yuwono, Dwi Margo; Sholehuddin, M. Sugeng; Suharto, Abdul Wachid Bambang – Journal of Social Studies Education Research, 2021
Learning at home not only provides written assignments that are changed in electronic form but must also reflect student learning outcomes at home. Likewise, researchers use literary reading to avoid students getting bored with learning Indonesian language literacy and character education. However, improving literacy skills is not just reading…
Descriptors: Indonesian, Computer Assisted Testing, Fiction, Literacy
Bosch, Nigel – Journal of Educational Data Mining, 2021
Automatic machine learning (AutoML) methods automate the time-consuming, feature-engineering process so that researchers produce accurate student models more quickly and easily. In this paper, we compare two AutoML feature engineering methods in the context of the National Assessment of Educational Progress (NAEP) data mining competition. The…
Descriptors: Accuracy, Learning Analytics, Models, National Competency Tests
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Raykov, Tenko; Marcoulides, George A.; Akaeze, Hope O. – Educational and Psychological Measurement, 2017
This note is concerned with examining the relationship between within-group and between-group variances in two-level nested designs. A latent variable modeling approach is outlined that permits point and interval estimation of their ratio and allows their comparison in a multilevel study. The procedure can also be used to test various hypotheses…
Descriptors: Comparative Analysis, Models, Statistical Analysis, Hierarchical Linear Modeling

Peer reviewed
Direct link
