Publication Date
In 2025 | 0 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 23 |
Since 2016 (last 10 years) | 59 |
Since 2006 (last 20 years) | 129 |
Descriptor
Generalization | 171 |
Scores | 171 |
Reliability | 44 |
Foreign Countries | 41 |
Meta Analysis | 36 |
Correlation | 30 |
Statistical Analysis | 27 |
Measures (Individuals) | 24 |
Academic Achievement | 23 |
Second Language Learning | 22 |
Language Tests | 19 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 2 |
Location
United States | 5 |
Australia | 4 |
Hong Kong | 4 |
Canada | 3 |
China | 3 |
Indiana | 3 |
Netherlands | 3 |
Spain | 3 |
United Kingdom (England) | 3 |
Finland | 2 |
Florida | 2 |
More ▼ |
Laws, Policies, & Programs
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024
We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…
Descriptors: Best Practices, Reliability, Counseling, Research
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Sojeong Nam; Byeolbee Um; Jeongwoon Jeong; Monique Rodriguez; David Lardier – Measurement and Evaluation in Counseling and Development, 2024
This study aimed to provide meta-analytic reliability information of the Columbia-Suicide Severity Rating Scale (C-SSRS). We implemented systematic search procedures to 35 eligible studies (N = 23,247; Mage = 26.74 years) that reported reliability estimates. The synthesized average values of Cronbach's alpha were 0.88 (95% CI [0.85, 0.92]) for the…
Descriptors: Scores, Test Reliability, Rating Scales, Suicide
Singh, Leah J.; Floyd, Randy G. – Contemporary School Psychology, 2023
Behavior rating scales measuring executive function have grown popular among school psychologists and other professions. To examine the generalizability of executive function rating scales, 42 parent-adolescent dyads, recruited from a school-based sample, completed the Behavior Rating Inventory of Executive Function, Second Edition and the…
Descriptors: Behavior Rating Scales, Executive Function, Adolescents, Parents
Abdulkadir Haktanir; M. Furkan Kurnaz; Zeynep Simsir Gökalp – Measurement and Evaluation in Counseling and Development, 2024
Objective: Brief Self-Control Scale (BSCS) is the most widely used instrument to assess self-control. The purpose of this reliability generalization meta-analysis was to examine the degree to which consistency reliability coefficients for scores on the BSCS generalize across age groups and languages. Method: We included studies using the BSCS and…
Descriptors: Self Control, Measures (Individuals), Meta Analysis, Test Reliability
Chan, Wendy – American Journal of Evaluation, 2022
Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…
Descriptors: Probability, Scores, Scoring, Generalization
Lenz, A. Stephen; Ho, Chia-Min; Rocha, Lauren; Aras, Yahyahan – Measurement and Evaluation in Counseling and Development, 2021
This study examined the degree that reliability coefficients for scores on the PTGI generalize across participant and study characteristics. Meta-analytic procedures resulted in observed and predicted mean alpha coefficients ranging from acceptable to excellent and appeared to be largely unrelated to the participant characteristics included in our…
Descriptors: Generalization, Test Reliability, Scores, Measures (Individuals)
Elif Sari – International Journal of Assessment Tools in Education, 2024
Employing G-theory and rater interviews, the study investigated how a high-stakes writing assessment procedure (i.e., a single-task, single-rater, and holistic scoring procedure) impacted the variability and reliability of its scores within the Turkish higher education context. Thirty-two essays written on two different writing tasks (i.e.,…
Descriptors: Foreign Countries, High Stakes Tests, Writing Evaluation, Scores
Sen, Sedat – Creativity Research Journal, 2022
The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…
Descriptors: Generalization, Creativity, Meta Analysis, Higher Education
MD, Soumya; Krishnamoorthy, Shivsubramani – Education and Information Technologies, 2022
In recent times, Educational Data Mining and Learning Analytics have been abundantly used to model decision-making to improve teaching/learning ecosystems. However, the adaptation of student models in different domains/courses needs a balance between the generalization and context specificity to reduce the redundancy in creating domain-specific…
Descriptors: Predictor Variables, Academic Achievement, Higher Education, Learning Analytics
García-Grau, Pau; McWilliam, R. A.; Bull, Kerry; Foster, John – Infants and Young Children, 2022
Functional plans in early childhood intervention need to include contextualized, meaningful, and measurable goals and include timelines and criteria for generalization. In addition, they must address children's and families' needs and priorities. The Routines-Based Interview has had a positive impact on the functionality of goals identified in the…
Descriptors: Foreign Countries, Preschool Children, Goal Orientation, Parent Attitudes
Jieun Kim; Daniel Richard Isbell – Language Assessment Quarterly, 2024
The ACTFL Assessment of Performance Toward Proficiency in Languages (AAPPL, https://www.actfl.n.d.org/assessments/k-12-assessments/aappl) assesses proficiency in 11 languages for students in grades 3 to 12 and is often used to award the Seal of Biliteracy. While arguments for the valid interpretation and uses of the AAPPL have previously been…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, Language Proficiency
Yogi, Jonathan Kimei – ProQuest LLC, 2023
Jung and Won's (2018) review of elementary school ER found a lack of understanding of instructional practices for ER with young children. Other researchers have called for further studies into what effective classroom orchestration and interaction look like within ER classrooms (Ioannou & Makridou, 2018; Xia & Zhong, 2019). This study was…
Descriptors: Computer Science Education, Robotics, Group Dynamics, Gender Differences
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity