Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 10 |
| Since 2007 (last 20 years) | 20 |
Descriptor
| Bayesian Statistics | 28 |
| Scores | 28 |
| Adaptive Testing | 9 |
| Computer Assisted Testing | 9 |
| Item Response Theory | 8 |
| Correlation | 7 |
| Hypothesis Testing | 6 |
| Statistical Analysis | 6 |
| Test Items | 6 |
| Probability | 5 |
| Simulation | 5 |
| More ▼ | |
Source
Author
| McBride, James R. | 3 |
| Chiang, Hanley S. | 2 |
| Schochet, Peter Z. | 2 |
| Weiss, David J. | 2 |
| Abad, Francisco J. | 1 |
| Boyd, Donald | 1 |
| Bradlow, Eric T. | 1 |
| Carvajal, Jorge | 1 |
| Castillo, Gladys | 1 |
| Chan, Greta | 1 |
| Chen, Yunxiao | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 22 |
| Journal Articles | 19 |
| Reports - Evaluative | 5 |
| Speeches/Meeting Papers | 2 |
| Opinion Papers | 1 |
| Reports - Descriptive | 1 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Elementary Education | 3 |
| Elementary Secondary Education | 2 |
| Grade 5 | 2 |
| Grade 6 | 2 |
| Intermediate Grades | 2 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| More ▼ | |
Audience
Location
| Indiana | 1 |
| Netherlands | 1 |
| New York | 1 |
| Portugal | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Early Childhood Longitudinal… | 1 |
| Indiana Statewide Testing for… | 1 |
| Wechsler Adult Intelligence… | 1 |
| Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023
Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…
Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing
Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022
A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…
Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length
Dalia Khairy; Nouf Alharbi; Mohamed A. Amasha; Marwa F. Areed; Salem Alkhalaf; Rania A. Abougalala – Education and Information Technologies, 2024
Student outcomes are of great importance in higher education institutions. Accreditation bodies focus on them as an indicator to measure the performance and effectiveness of the institution. Forecasting students' academic performance is crucial for every educational establishment seeking to enhance performance and perseverance of its students and…
Descriptors: Prediction, Tests, Scores, Information Retrieval
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019
According to Wollack and Schoenig (2018), score differencing is one of six types of statistical methods used to detect test fraud. In this paper, we suggested the use of Bayes factors (e.g., Kass & Raftery, 1995) for score differencing. A simulation study shows that the suggested approach performs slightly better than an existing frequentist…
Descriptors: Cheating, Deception, Statistical Analysis, Bayesian Statistics
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Magliano, Joseph P.; Lampi, Jodi P.; Ray, Melissa; Chan, Greta – Grantee Submission, 2020
Coherent mental models for successful comprehension require inferences that establish semantic "bridges" between discourse constituents and "elaborations" that incorporate relevant background knowledge. While it is established that individual differences in the extent to which postsecondary students engage in these processes…
Descriptors: Reading Comprehension, Reading Strategies, Inferences, Reading Tests
Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017
In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…
Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods
Rozell, Timothy G.; Johnson, Jessica; Sexten, Andrea; Rhodes, Ashley E. – Journal of College Science Teaching, 2017
Students in a junior- and senior-level Anatomy and Physiology course have the opportunity to correct missed exam questions ("regrade") and earn up to half of the original points missed. The three objectives of this study were to determine if: (a) performance on the regrade assignment was correlated with scores on subsequent exams, (b)…
Descriptors: Physiology, Scores, Grades (Scholastic), Exit Examinations
Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's "Clarifying the Consensus Definition of Validity" addresses the single most important, yet stubbornly protean, value in educational and psychological assessment. "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council on Measurement in…
Descriptors: Evidence, Validity, Educational Testing, Psychological Evaluation
Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015
The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…
Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement
Millan, Eva; Descalco, Luis; Castillo, Gladys; Oliveira, Paula; Diogo, Sandra – Computers & Education, 2013
In this paper, we describe the integration and evaluation of an existing generic Bayesian student model (GBSM) into an existing computerized testing system within the Mathematics Education Project (PmatE--Projecto Matematica Ensino) of the University of Aveiro. This generic Bayesian student model had been previously evaluated with simulated…
Descriptors: Computer Assisted Testing, Equations (Mathematics), Mathematics Education, Expertise
Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014
Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…
Descriptors: Simulation, Evaluation Methods, Games, Data Collection
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Hodges, Jaret; McIntosh, Jason; Gentry, Marcia – Journal of Advanced Academics, 2017
High-potential students from low-income families are at an academic disadvantage compared with their more affluent peers. To address this issue, researchers have suggested novel approaches to mitigate gaps in student performance, including out-of-school enrichment programs. Longitudinal mixed effects modeling was used to analyze the growth of…
Descriptors: After School Programs, Enrichment Activities, Academic Achievement, High Achievement
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
