Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 30 |
Descriptor
Source
Author
Bastick, Tony | 4 |
Sireci, Stephen G. | 3 |
Thompson, Bruce | 3 |
Baker, Eva L. | 2 |
Bard, E. M. | 2 |
Benor, Dan E. | 2 |
Capie, William | 2 |
Cook, Colleen | 2 |
Dereshiwsky, Mary I. | 2 |
Ellett, Chad D. | 2 |
Evans, Lynn | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 11 |
Elementary Education | 8 |
Postsecondary Education | 7 |
Elementary Secondary Education | 5 |
Secondary Education | 5 |
Grade 4 | 3 |
Grade 5 | 3 |
Grade 6 | 3 |
Early Childhood Education | 2 |
Grade 10 | 2 |
High Schools | 2 |
More ▼ |
Location
Australia | 5 |
California | 5 |
United Kingdom | 5 |
Netherlands | 3 |
Canada | 2 |
Florida | 2 |
Illinois | 2 |
Israel | 2 |
Massachusetts | 2 |
Ohio | 2 |
Saudi Arabia | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 4 |
Education Consolidation… | 2 |
Comprehensive Education… | 1 |
Elementary and Secondary… | 1 |
First Amendment | 1 |
Vocational Education… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024
The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…
Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Karen Leary Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
Assessment is a topic of concern to all stakeholders in our educational system. Pattern Based Questions are an assessment tool which is an alternative to the standardized assessment tool, and they are based on generative learning pedagogy, which shows promise in engaging all learners and usefulness in teaching and learning but validity has not yet…
Descriptors: Undergraduate Students, College Mathematics, Mathematics Skills, Thinking Skills
Kerrigan, Sarah; Norton, Anderson; Ulrich, Catherine – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020
We report on and validate a system for ranking the cognitive demand of mathematical tasks. In our framework, task rankings are determined by the sequences of units and unit transformations students might use to solve each task. Using this framework, we ranked a set of 10 fractions tasks. We then interviewed 12 pre-service teachers to assess the…
Descriptors: Cognitive Processes, Difficulty Level, Fractions, Evaluation Methods
Ill-Defined but Well-Measured? Validating Measures of Noncognitive Skills in Large-Scale Assessments
Borgonovi, Francesca; Ferrara, Alessandro; Piacentini, Mario – AERA Online Paper Repository, 2020
Non-cognitive skills are routinely measured using self-reports in the context of large-scale international assessments. However questions remain on the adequacy of self-reports to conduct comparisons. Measures that exploit test-taker's behaviour during the completion of questionnaires or of the cognitive tests have been proposed in the literature…
Descriptors: Evaluation Methods, Measures (Individuals), Student Evaluation, Validity
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Gates, Emily; Benitez Alvarez, Kayla M. – AERA Online Paper Repository, 2022
Evaluators have opportunities to advance equity within evaluations, yet little research has examined whether and how evaluators center equity in evaluation practice. This paper explores whether and how evaluators in New England address inequities and advance equity throughout evaluation phases. The study uses a complementarity, sequential mixed…
Descriptors: Evaluators, Professional Development, Context Effect, Social Justice
Horvat, Saša A.; Rodic, Dušica D.; Roncevic, Tamara N.; Babic-Kekez, Snežana; Horvat, Bojana Trifunovic – International Baltic Symposium on Science and Technology Education, 2021
Mathematical calculations are an important part of chemistry. Those problems are difficult for students, especially if the task is set with a limiting reactant. The aim of this study was development of a Procedure for evaluation of cognitive complexity of the Stoichiometric Tasks with a Limiting Reactant. The procedure created included an…
Descriptors: Likert Scales, Chemistry, Science Instruction, Task Analysis
Clairmont, Albert Anthony; Katz, Daniel; Wilton, Mike – AERA Online Paper Repository, 2021
This study demonstrates the importance of Rasch Measurement Theory (RMT) in program evaluation when outcome measures need to be constructed from scratch. The paper introduces typical measure validation methods presented in program evaluation texts and discusses room for improvement. The study then illustrates how the seamless transitions from…
Descriptors: Program Evaluation, Measurement Techniques, Validity, Ethnography
Lester, Leanne; Cefai, Carmel; Cavioni, Valeria; Barnes, Amy; Professor, Donna Cross – Australian Journal of Teacher Education, 2020
A caring school community can enhance whole-school wellbeing including the wellbeing of school staff, which directly impacts on student academic, social and emotional wellbeing. This study firstly examines the validity and reliability of a proposed wholeschool staff wellbeing evaluation tool which uses a set of whole-school wellbeing indicators to…
Descriptors: Well Being, School Personnel, Test Construction, Test Validity
Sandvik, Lise Vikan; Fjoertoft, Henning – AERA Online Paper Repository, 2016
This paper reports findings from a national wide research project in Norway called "Research on individual assessment in schools" (FIVIS), with the main purpose to gain knowledge of how assessment stimulates learning and what characterizes school practices and classroom practices when assessment is used as a tool for learning.The project…
Descriptors: Foreign Countries, Validity, Fundamental Concepts, Evaluation Methods
Aydin, Selami; Harputlu, Leyla; Çelik, Seyda Savran; Ustuk, Özgehan; Güzel, Serhat; Genç, Deniz – Online Submission, 2016
Measurement of children's behaviors in an educational and research context is a problematic and complex area. It is also evident that adapting scales to measure children's behaviors in an educational and research context is a complex process due to several reasons. First, cultural elements constitute a considerable problem. Second, it is difficult…
Descriptors: Child Behavior, Models, Test Construction, Test Validity
Shin, Youngjoon; Seo, Hae-Ae; Hong, Jun-Euy – International Baltic Symposium on Science and Technology Education, 2019
This research aimed to develop an assessment tool for students' Positive Experiences about Science (PES). A preliminary version of PSE was developed through literature review, consisting of academic emotion, self-concept, learning motivation, career aspiration, and attitude in science. A pilot test was conducted with 198 students and a main test…
Descriptors: Positive Attitudes, Student Experience, Science Education, Evaluation Methods
Falakmasir, Mohammad; Yudelson, Michael; Ritter, Steve; Koedinger, Ken – International Educational Data Mining Society, 2015
Bayesian Knowledge Tracing (BKT) has been in wide use for modeling student skill acquisition in Intelligent Tutoring Systems (ITS). BKT tracks and updates student's latent mastery of a skill as a probability distribution of a binary variable. BKT does so by accounting for observed student successes in applying the skill correctly, where success is…
Descriptors: Bayesian Statistics, Models, Skill Development, Intelligent Tutoring Systems