ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	30

Descriptor

Evaluation Methods	385
Test Validity	194
Validity	144
Higher Education	92
Test Construction	84
Elementary Secondary Education	82
Test Reliability	80
Student Evaluation	76
Foreign Countries	56
Measurement Techniques	52
Educational Assessment	50
Reliability	47
Program Evaluation	40
Evaluation Criteria	39
Models	39
Teacher Evaluation	34
Research Methodology	31
Construct Validity	29
Test Use	29
Teacher Effectiveness	26
Comparative Analysis	25
Performance Based Assessment	25
Predictive Validity	25
Questionnaires	24
College Students	23
More ▼

Publication Type

Speeches/Meeting Papers	385
Reports - Research	206
Reports - Evaluative	77
Opinion Papers	36
Tests/Questionnaires	36
Reports - Descriptive	33
Information Analyses	22
Journal Articles	15
Guides - Non-Classroom	4
Reports - General	4
Collected Works - General	3
Reference Materials -…	3
Book/Product Reviews	2
Numerical/Quantitative Data	2
Guides - General	1
Historical Materials	1
More ▼

Education Level

Higher Education	11
Elementary Education	8
Postsecondary Education	7
Elementary Secondary Education	5
Secondary Education	5
Grade 4	3
Grade 5	3
Grade 6	3
Early Childhood Education	2
Grade 10	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Grade 12	1
Grade 3	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	44
Practitioners	13
Teachers	4
Administrators	3
Policymakers	2
Media Staff	1

Location

Australia	5
California	5
United Kingdom	5
Netherlands	3
Canada	2
Florida	2
Illinois	2
Israel	2
Massachusetts	2
Ohio	2
Saudi Arabia	2
South Korea	2
Taiwan	2
United Arab Emirates	2
Arizona	1
Belgium	1
Brazil	1
Canada (Ottawa)	1
Colorado	1
Connecticut	1
Cyprus	1
Denmark	1
Finland	1
Germany	1
Hong Kong	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	4
Education Consolidation…	2
Comprehensive Education…	1
Elementary and Secondary…	1
First Amendment	1
Vocational Education…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 385 results Save | Export

Peering through the Kaleidoscope: A Systematic Review of Creativity Assessments within PK-12 Settings

Peer reviewed

Direct link

Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024

The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…

Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity

Examining Inter-Rater Reliability of Evaluators Judging Teacher Performance: Proposing an Alternative to Cohen's Kappa. CEME Technical Report. CEMETR-2021-06

Download full text

Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021

The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…

Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods

Two-Stage Polytomous Attribute Estimation Methods: Overcoming Computational Challenges in Large-Scale Assessments with Polytomous Attributes

Peer reviewed

Direct link

Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024

To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…

Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data

Examining the Validity of a Generative Education Pattern Based Question

Peer reviewed
PDF on ERIC

Download full text

Karen Leary Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

Assessment is a topic of concern to all stakeholders in our educational system. Pattern Based Questions are an assessment tool which is an alternative to the standardized assessment tool, and they are based on generative learning pedagogy, which shows promise in engaging all learners and usefulness in teaching and learning but validity has not yet…

Descriptors: Undergraduate Students, College Mathematics, Mathematics Skills, Thinking Skills

Ranking the Cognitive Demand of Fractions Tasks

Peer reviewed
PDF on ERIC

Download full text

Kerrigan, Sarah; Norton, Anderson; Ulrich, Catherine – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020

We report on and validate a system for ranking the cognitive demand of mathematical tasks. In our framework, task rankings are determined by the sequences of units and unit transformations students might use to solve each task. Using this framework, we ranked a set of 10 fractions tasks. We then interviewed 12 pre-service teachers to assess the…

Descriptors: Cognitive Processes, Difficulty Level, Fractions, Evaluation Methods

Ill-Defined but Well-Measured? Validating Measures of Noncognitive Skills in Large-Scale Assessments

Peer reviewed

Direct link

Borgonovi, Francesca; Ferrara, Alessandro; Piacentini, Mario – AERA Online Paper Repository, 2020

Non-cognitive skills are routinely measured using self-reports in the context of large-scale international assessments. However questions remain on the adequacy of self-reports to conduct comparisons. Measures that exploit test-taker's behaviour during the completion of questionnaires or of the cognitive tests have been proposed in the literature…

Descriptors: Evaluation Methods, Measures (Individuals), Student Evaluation, Validity

Development of a Situational Judgment Task for Assessing Teacher Leadership in Mathematics

Peer reviewed

Direct link

Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017

Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…

Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education

Equity "On the Sideline" of Evaluations: A Mixed-Methods Study of New England Evaluators' Practices in 2020

Peer reviewed

Direct link

Gates, Emily; Benitez Alvarez, Kayla M. – AERA Online Paper Repository, 2022

Evaluators have opportunities to advance equity within evaluations, yet little research has examined whether and how evaluators center equity in evaluation practice. This paper explores whether and how evaluators in New England address inequities and advance equity throughout evaluation phases. The study uses a complementarity, sequential mixed…

Descriptors: Evaluators, Professional Development, Context Effect, Social Justice

Procedure for Assessment of the Cognitive Complexity of the Problems with a Limiting Reactant

Peer reviewed
PDF on ERIC

Download full text

Horvat, Saša A.; Rodic, Dušica D.; Roncevic, Tamara N.; Babic-Kekez, Snežana; Horvat, Bojana Trifunovic – International Baltic Symposium on Science and Technology Education, 2021

Mathematical calculations are an important part of chemistry. Those problems are difficult for students, especially if the task is set with a limiting reactant. The aim of this study was development of a Procedure for evaluation of cognitive complexity of the Stoichiometric Tasks with a Limiting Reactant. The procedure created included an…

Descriptors: Likert Scales, Chemistry, Science Instruction, Task Analysis

Using Rasch Measurement Theory for Responsive Program Evaluation

Peer reviewed

Direct link

Clairmont, Albert Anthony; Katz, Daniel; Wilton, Mike – AERA Online Paper Repository, 2021

This study demonstrates the importance of Rasch Measurement Theory (RMT) in program evaluation when outcome measures need to be constructed from scratch. The paper introduces typical measure validation methods presented in program evaluation texts and discusses room for improvement. The study then illustrates how the seamless transitions from…

Descriptors: Program Evaluation, Measurement Techniques, Validity, Ethnography

A Whole-School Approach to Promoting Staff Wellbeing

Peer reviewed
PDF on ERIC

Download full text

Lester, Leanne; Cefai, Carmel; Cavioni, Valeria; Barnes, Amy; Professor, Donna Cross – Australian Journal of Teacher Education, 2020

A caring school community can enhance whole-school wellbeing including the wellbeing of school staff, which directly impacts on student academic, social and emotional wellbeing. This study firstly examines the validity and reliability of a proposed wholeschool staff wellbeing evaluation tool which uses a set of whole-school wellbeing indicators to…

Descriptors: Well Being, School Personnel, Test Construction, Test Validity

Validity as Threshold Concept to Develop Assessment Cultures in Schools

Peer reviewed

Direct link

Sandvik, Lise Vikan; Fjoertoft, Henning – AERA Online Paper Repository, 2016

This paper reports findings from a national wide research project in Norway called "Research on individual assessment in schools" (FIVIS), with the main purpose to gain knowledge of how assessment stimulates learning and what characterizes school practices and classroom practices when assessment is used as a tool for learning.The project…

Descriptors: Foreign Countries, Validity, Fundamental Concepts, Evaluation Methods

Adapting Scale for Children: A Practical Model for Researchers

Download full text

Aydin, Selami; Harputlu, Leyla; Çelik, Seyda Savran; Ustuk, Özgehan; Güzel, Serhat; Genç, Deniz – Online Submission, 2016

Measurement of children's behaviors in an educational and research context is a problematic and complex area. It is also evident that adapting scales to measure children's behaviors in an educational and research context is a complex process due to several reasons. First, cultural elements constitute a considerable problem. Second, it is difficult…

Descriptors: Child Behavior, Models, Test Construction, Test Validity

Development of an Assessment Tool for Positive Experiences about Science (PES)

Peer reviewed
PDF on ERIC

Download full text

Shin, Youngjoon; Seo, Hae-Ae; Hong, Jun-Euy – International Baltic Symposium on Science and Technology Education, 2019

This research aimed to develop an assessment tool for students' Positive Experiences about Science (PES). A preliminary version of PSE was developed through literature review, consisting of academic emotion, self-concept, learning motivation, career aspiration, and attitude in science. A pilot test was conducted with 198 students and a main test…

Descriptors: Positive Attitudes, Student Experience, Science Education, Evaluation Methods

Spectral Bayesian Knowledge Tracing

Download full text

Falakmasir, Mohammad; Yudelson, Michael; Ritter, Steve; Koedinger, Ken – International Educational Data Mining Society, 2015

Bayesian Knowledge Tracing (BKT) has been in wide use for modeling student skill acquisition in Intelligent Tutoring Systems (ITS). BKT tracks and updates student's latent mastery of a skill as a probability distribution of a binary variable. BKT does so by accounting for observed student successes in applying the skill correctly, where success is…

Descriptors: Bayesian Statistics, Models, Skill Development, Intelligent Tutoring Systems

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Online Submission	12
AERA Online Paper Repository	9
North American Chapter of the…	4
Psychological Assessment	3
Academic Medicine	2
Educational Measurement:…	2
International Baltic…	2
International Educational…	2
International Group for the…	2
American Journal of Evaluation	1
Appalachia Educational…	1
Applied Psychological…	1
Association for Educational…	1
Association for Institutional…	1
Australian Journal of Teacher…	1
Bulgarian Comparative…	1
Center for Educational…	1
Educational Assessment	1
Educational Researcher	1
Evaluation Practice	1
Intelligence	1
Pearson	1
Public Libraries	1
Research-publishing.net	1
More ▼

Bastick, Tony	4
Sireci, Stephen G.	3
Thompson, Bruce	3
Baker, Eva L.	2
Bard, E. M.	2
Benor, Dan E.	2
Capie, William	2
Cook, Colleen	2
Dereshiwsky, Mary I.	2
Ellett, Chad D.	2
Evans, Lynn	2
Hickey, Daniel T.	2
Impara, James C.	2
Jaeger, Richard M.	2
Kindfield, Ann C. H.	2
Linn, Robert L.	2
Mott, Michael S.	2
Perry, Joseph D.	2
Philippou, George	2
Plake, Barbara S.	2
Pollard, John D. E.	2
Tucker, Null A.	2
Webster, William J.	2
Wolfe, Edward W.	2
More ▼

ACT Assessment	2
Behavioral and Emotional…	2
Iowa Tests of Basic Skills	2
National Assessment of…	2
National Teacher Examinations	2
Strong Campbell Interest…	2
Trends in International…	2
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Child Behavior Checklist	1
Comprehensive Tests of Basic…	1
Continuous Performance Test	1
Group Assessment of Logical…	1
Group Embedded Figures Test	1
Learning Style Inventory	1
Levels of Use of the…	1
Locke Wallace Marital…	1
Motivated Strategies for…	1
Myers Briggs Type Indicator	1
National Survey of Student…	1
Praxis Series	1
Productivity Environmental…	1
Program for International…	1
Rokeach Value Survey	1
Self Directed Learning…	1
More ▼