Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 8 |
Descriptor
Accuracy | 8 |
Test Construction | 8 |
Test Format | 8 |
Test Items | 4 |
Adaptive Testing | 2 |
Classification | 2 |
Computer Assisted Testing | 2 |
Correlation | 2 |
Cutting Scores | 2 |
Item Response Theory | 2 |
Language Tests | 2 |
More ▼ |
Source
ProQuest LLC | 2 |
AERA Online Paper Repository | 1 |
Assessment for Effective… | 1 |
International Online Journal… | 1 |
Journal of Educational… | 1 |
Practical Assessment,… | 1 |
Studies in Second Language… | 1 |
Author
Arslan, Recep Sahin | 1 |
Babcock, Ben | 1 |
Castle, Courtney | 1 |
Chafouleas, Sandra M. | 1 |
Foley, Brett | 1 |
Granena, Gisela | 1 |
He, Wei | 1 |
Jing Ma | 1 |
Luo, Xin | 1 |
Miller, Faith G. | 1 |
Reckase, Mark D. | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 5 |
Dissertations/Theses -… | 2 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023
The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…
Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format
Miller, Faith G.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Schardt, Alyssa A. – Assessment for Effective Intervention, 2017
The purpose of this study was to investigate the impact of two different Direct Behavior Rating--Single Item Scale (DBR-SIS) formats on rating accuracy. A total of 119 undergraduate students participated in one of two study conditions, each utilizing a different DBR-SIS scale format: one that included percentage of time anchors on the DBR-SIS…
Descriptors: Behavior Rating Scales, Test Format, Accuracy, Undergraduate Students
Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016
A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…
Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification
Arslan, Recep Sahin; Üçok-Atasoy, Meral – International Online Journal of Education and Teaching, 2020
On the grounds that assessment stands for a mirror of teaching and learning practices, its value cannot be ignored in teaching English as a Foreign Language (EFL) programmes as all those involved in foreign language teaching in non-native settings need constant feedback about the effectiveness of their ventures. Assessment of young learners of…
Descriptors: English (Second Language), English Teachers, Second Language Learning, Student Evaluation
Luo, Xin; Reckase, Mark D.; He, Wei – AERA Online Paper Repository, 2016
While dichotomous item dominates the application of computerized adaptive testing (CAT), polytomous item and set-based item hold promises for being incorporated in CAT. However, how to assemble a CAT containing mixed item formats is challenging. This study investigated: (1) how the mixed CAT works compared with the dichotomous-item-based CAT; (2)…
Descriptors: Test Items, Test Format, Computer Assisted Testing, Adaptive Testing
Granena, Gisela – Studies in Second Language Acquisition, 2019
This study investigated the underlying structure of a set of eight cognitive tests from the two most recent language aptitude test batteries: the LLAMA (Meara, 2005) and the Hi-LAB (Linck et al., 2013) to see whether they had any underlying constructs in common. The study also examined whether any of the observed constructs could predict L2…
Descriptors: Second Language Learning, Intelligence Tests, Memory, Language Aptitude
Castle, Courtney – ProQuest LLC, 2018
The Next Generation Science Standards propose a multidimensional model of science learning, comprised of Core Disciplinary Ideas, Science and Engineering Practices, and Crosscutting Concepts (NGSS Lead States, 2013). Accordingly, there is a need for student assessment aligned with the new standards. Creating assessments that validly and reliably…
Descriptors: Science Education, Student Evaluation, Science Tests, Test Construction