ERIC - Search Results

Publication Date

In 2025	0
Since 2024	7

Source

Measurement and Evaluation in…	3
Asia Pacific Education Review	1
International Journal of…	1
Language Assessment Quarterly	1
ProQuest LLC	1

Author

Abdulkadir Haktanir	1
Bradley T. Erford	1
Byeolbee Um	1
Daniel Richard Isbell	1
David Lardier	1
Elif Sari	1
Jeongwoon Jeong	1
Jieun Kim	1
M. Furkan Kurnaz	1
Mingying Zheng	1
Monique Rodriguez	1
Quentin Hunter	1
Richard S. Balkin	1
Sojeong Nam	1
Wendy Chan	1
Zeynep Simsir Gökalp	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	3
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Reliable for Whom? Inferring and Reporting Reliability across Diverse Populations

Peer reviewed

Direct link

Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024

We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…

Descriptors: Best Practices, Reliability, Counseling, Research

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

Score Reliability Generalization of the Columbia-Suicide Severity Rating Scale (C-SSRS): A Meta-Analysis

Peer reviewed

Direct link

Sojeong Nam; Byeolbee Um; Jeongwoon Jeong; Monique Rodriguez; David Lardier – Measurement and Evaluation in Counseling and Development, 2024

This study aimed to provide meta-analytic reliability information of the Columbia-Suicide Severity Rating Scale (C-SSRS). We implemented systematic search procedures to 35 eligible studies (N = 23,247; Mage = 26.74 years) that reported reliability estimates. The synthesized average values of Cronbach's alpha were 0.88 (95% CI [0.85, 0.92]) for the…

Descriptors: Scores, Test Reliability, Rating Scales, Suicide

Reliability Generalization Meta-Analysis of the Brief Self-Control Scale (BSCS): Reliability Evidence across Age Groups and Languages

Peer reviewed

Direct link

Abdulkadir Haktanir; M. Furkan Kurnaz; Zeynep Simsir Gökalp – Measurement and Evaluation in Counseling and Development, 2024

Objective: Brief Self-Control Scale (BSCS) is the most widely used instrument to assess self-control. The purpose of this reliability generalization meta-analysis was to examine the degree to which consistency reliability coefficients for scores on the BSCS generalize across age groups and languages. Method: We included studies using the BSCS and…

Descriptors: Self Control, Measures (Individuals), Meta Analysis, Test Reliability

Investigating the Quality of a High-Stakes EFL Writing Assessment Procedure in the Turkish Higher Education Context

Peer reviewed
PDF on ERIC

Download full text

Elif Sari – International Journal of Assessment Tools in Education, 2024

Employing G-theory and rater interviews, the study investigated how a high-stakes writing assessment procedure (i.e., a single-task, single-rater, and holistic scoring procedure) impacted the variability and reliability of its scores within the Turkish higher education context. Thirty-two essays written on two different writing tasks (i.e.,…

Descriptors: Foreign Countries, High Stakes Tests, Writing Evaluation, Scores

Test Review: ACTFL Assessment of Performance toward Proficiency in Languages (AAPPL)

Peer reviewed

Direct link

Jieun Kim; Daniel Richard Isbell – Language Assessment Quarterly, 2024

The ACTFL Assessment of Performance Toward Proficiency in Languages (AAPPL, https://www.actfl.n.d.org/assessments/k-12-assessments/aappl) assesses proficiency in 11 languages for students in grades 3 to 12 and is often used to award the Seal of Biliteracy. While arguments for the valid interpretation and uses of the AAPPL have previously been…

Descriptors: Language Tests, Second Language Learning, Second Language Instruction, Language Proficiency

Using Data Preprocessing Techniques and Machine Learning Algorithms to Explore Predictors of Word Difficulty in English Language Assessment

Direct link

Mingying Zheng – ProQuest LLC, 2024

The digital transformation in educational assessment has led to the proliferation of large-scale data, offering unprecedented opportunities to enhance language learning, and testing through machine learning (ML) techniques. Drawing on the extensive data generated by online English language assessments, this dissertation investigates the efficacy…

Descriptors: Artificial Intelligence, Computational Linguistics, Language Tests, English (Second Language)

Generalization	7
Scores	7
Meta Analysis	3
Second Language Instruction	3
Accuracy	2
Counseling	2
English (Second Language)	2
Foreign Countries	2
Language Tests	2
Reliability	2
Second Language Learning	2
Test Reliability	2
Age Groups	1
Algorithms	1
Artificial Intelligence	1
Best Practices	1
Bilingualism	1
Causal Models	1
College Students	1
Computation	1
Computational Linguistics	1
Computer Assisted Testing	1
Computer Software	1
Costs	1
Counseling Services	1
More ▼