ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	8

Descriptor

Accuracy	8
Test Construction	8
Test Format	8
Test Items	4
Adaptive Testing	2
Classification	2
Computer Assisted Testing	2
Correlation	2
Cutting Scores	2
Item Response Theory	2
Language Tests	2
Prediction	2
Psychometrics	2
Scores	2
Second Language Learning	2
Student Evaluation	2
Alignment (Education)	1
Alternative Assessment	1
Architecture	1
Behavior Rating Scales	1
Case Studies	1
Cognitive Processes	1
College Students	1
Communicative Competence…	1
Comparative Analysis	1
More ▼

Source

ProQuest LLC	2
AERA Online Paper Repository	1
Assessment for Effective…	1
International Online Journal…	1
Journal of Educational…	1
Practical Assessment,…	1
Studies in Second Language…	1

Author

Arslan, Recep Sahin	1
Babcock, Ben	1
Castle, Courtney	1
Chafouleas, Sandra M.	1
Foley, Brett	1
Granena, Gisela	1
He, Wei	1
Jing Ma	1
Luo, Xin	1
Miller, Faith G.	1
Reckase, Mark D.	1
Riley-Tillman, T. Chris	1
Schardt, Alyssa A.	1
Wolkowitz, Amanda A.	1
Wyse, Adam E.	1
Zurn, Jared	1
Üçok-Atasoy, Meral	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5
Dissertations/Theses -…	2
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

A Method for Converting 4-Option Multiple-Choice Items to 3-Option Multiple-Choice Items without Re-Pretesting

Peer reviewed
PDF on ERIC

Download full text

Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023

The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…

Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format

Direct Behavior Rating Instrumentation: Evaluating the Impact of Scale Formats

Peer reviewed
PDF on ERIC

Download full text

Direct link

Miller, Faith G.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Schardt, Alyssa A. – Assessment for Effective Intervention, 2017

The purpose of this study was to investigate the impact of two different Direct Behavior Rating--Single Item Scale (DBR-SIS) formats on rating accuracy. A total of 119 undergraduate students participated in one of two study conditions, each utilizing a different DBR-SIS scale format: one that included percentage of time anchors on the DBR-SIS…

Descriptors: Behavior Rating Scales, Test Format, Accuracy, Undergraduate Students

Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016

A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification

An Investigation into EFL Teachers' Assessment of Young Learners of English: Does Practice Match the Policy?

Peer reviewed
PDF on ERIC

Download full text

Arslan, Recep Sahin; Üçok-Atasoy, Meral – International Online Journal of Education and Teaching, 2020

On the grounds that assessment stands for a mirror of teaching and learning practices, its value cannot be ignored in teaching English as a Foreign Language (EFL) programmes as all those involved in foreign language teaching in non-native settings need constant feedback about the effectiveness of their ventures. Assessment of young learners of…

Descriptors: English (Second Language), English Teachers, Second Language Learning, Student Evaluation

Incorporating Mixed Item Formats in Computerized Adaptive Testing: A Comparison of Shadow Test and Bin-Structured Approach

Peer reviewed

Direct link

Luo, Xin; Reckase, Mark D.; He, Wei – AERA Online Paper Repository, 2016

While dichotomous item dominates the application of computerized adaptive testing (CAT), polytomous item and set-based item hold promises for being incorporated in CAT. However, how to assemble a CAT containing mixed item formats is challenging. This study investigated: (1) how the mixed CAT works compared with the dichotomous-item-based CAT; (2)…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Adaptive Testing

Cognitive Aptitudes and L2 Speaking Proficiency: Links between LLAMA and HI-LAB

Peer reviewed

Direct link

Granena, Gisela – Studies in Second Language Acquisition, 2019

This study investigated the underlying structure of a set of eight cognitive tests from the two most recent language aptitude test batteries: the LLAMA (Meara, 2005) and the Hi-LAB (Linck et al., 2013) to see whether they had any underlying constructs in common. The study also examined whether any of the observed constructs could predict L2…

Descriptors: Second Language Learning, Intelligence Tests, Memory, Language Aptitude

Measuring Multidimensional Science Learning: Item Design, Scoring, and Psychometric Considerations

Direct link

Castle, Courtney – ProQuest LLC, 2018

The Next Generation Science Standards propose a multidimensional model of science learning, comprised of Core Disciplinary Ideas, Science and Engineering Practices, and Crosscutting Concepts (NGSS Lead States, 2013). Accordingly, there is a need for student assessment aligned with the new standards. Creating assessments that validly and reliably…

Descriptors: Science Education, Student Evaluation, Science Tests, Test Construction