ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	43
Since 2006 (last 20 years)	113

Descriptor

Evaluation Methods	193
Statistical Analysis	193
Test Reliability	87
Reliability	76
Test Validity	53
Validity	40
Interrater Reliability	38
Correlation	37
Measurement Techniques	35
Foreign Countries	33
Research Methodology	32
Student Evaluation	29
Models	27
Questionnaires	24
Comparative Analysis	21
Test Construction	21
Factor Analysis	20
Program Evaluation	20
Scores	20
Measures (Individuals)	19
Rating Scales	16
College Students	15
Item Analysis	15
Observation	15
Psychometrics	15
More ▼

Education Level

Higher Education	35
Postsecondary Education	25
Elementary Secondary Education	14
Elementary Education	12
Secondary Education	12
Middle Schools	11
Early Childhood Education	7
High Schools	7
Junior High Schools	6
Grade 7	4
Adult Education	3
Grade 3	3
Primary Education	3
Grade 1	2
Grade 2	2
Grade 8	2
Preschool Education	2
Grade 10	1
Grade 11	1
Grade 12	1
Grade 5	1
Grade 9	1
Two Year Colleges	1
More ▼

Audience

Researchers	4
Practitioners	3
Students	3
Counselors	1

Location

United Kingdom	5
California	4
Florida	4
Jordan	4
Netherlands	3
Iran	2
New Zealand	2
South Korea	2
Arizona	1
Arizona (Phoenix)	1
Arizona (Tucson)	1
Asia	1
China (Beijing)	1
Colombia	1
Colorado (Denver)	1
European Union	1
Georgia	1
Illinois	1
Indonesia	1
Japan	1
Malaysia	1
Maryland	1
Mexico	1
Minnesota	1
New Jersey	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 1 to 15 of 193 results Save | Export

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

Agree to Disagree: Multiple Methods to Assess Rater Agreement during Student Teaching

Peer reviewed

Direct link

Elayne P. Colón; Lori M. Dassa; Thomas M. Dana; Nathan P. Hanson – Action in Teacher Education, 2024

To meet accreditation expectations, teacher preparation programs must demonstrate their candidates are evaluated using summative assessment tools that yield sound, reliable, and valid data. These tools are primarily used by the clinical experience team -- university supervisors and mentor teachers. Institutional beliefs regarding best practices…

Descriptors: Student Teachers, Teacher Interns, Evaluation Methods, Interrater Reliability

Reliability Evidence for the NC Teacher Evaluation Process Using a Variety of Indicators of Inter-Rater Agreement

Peer reviewed
PDF on ERIC

Download full text

Holcomb, T. Scott; Lambert, Richard; Bottoms, Bryndle L. – Journal of Educational Supervision, 2022

In this study, various statistical indexes of agreement were calculated using empirical data from a group of evaluators (n = 45) of early childhood teachers. The group of evaluators rated ten fictitious teacher profiles using the North Carolina Teacher Evaluation Process (NCTEP) rubric. The exact and adjacent agreement percentages were calculated…

Descriptors: Interrater Reliability, Teacher Evaluation, Statistical Analysis, Early Childhood Teachers

What Constitutes Expertise in Research Ethics and Integrity?

Peer reviewed

Direct link

Braun, Robert; Ravn, Tine; Frankus, Elisabeth – Research Ethics, 2020

In this paper we reflect on the looming question of what constitutes expertise in ethics. Based on an empirical program that involved qualitative and quantitative as well as participatory research elements we show that expertise in research ethics and integrity is based on experience in the assessment processes. We then connect traditional…

Descriptors: Integrity, Ethics, Participatory Research, Qualitative Research

The Counseling Competencies Scale: Validation and Refinement

Peer reviewed

Direct link

Lambie, Glenn W.; Mullen, Patrick R.; Swank, Jacqueline M.; Blount, Ashley – Measurement and Evaluation in Counseling and Development, 2018

Supervisors evaluated counselors-in-training at multiple points during their practicum experience using the Counseling Competencies Scale (CCS; N = 1,070). The CCS evaluations were randomly split to conduct exploratory factor analysis and confirmatory factor analysis, resulting in a 2-factor model (61.5% of the variance explained).

Descriptors: Counselor Training, Counseling, Measures (Individuals), Competence

Searching for G: A New Evaluation of SPM-LS Dimensionality

Peer reviewed
PDF on ERIC

Download full text

Garcia-Garzon, Eduardo; Abad, Francisco J.; Garrido, Luis E. – Journal of Intelligence, 2019

There has been increased interest in assessing the quality and usefulness of short versions of the Raven's Progressive Matrices. A recent proposal, composed of the last twelve matrices of the Standard Progressive Matrices (SPM-LS), has been depicted as a valid measure of "g." Nonetheless, the results provided in the initial validation…

Descriptors: Intelligence Tests, Test Validity, Evaluation Methods, Undergraduate Students

Assessment of Interrater and Intermethod Agreement in the Kinesiology Literature

Peer reviewed

Direct link

Looney, Marilyn A. – Measurement in Physical Education and Exercise Science, 2018

The purpose of this article was two-fold (1) provide an overview of the commonly reported and under-reported absolute agreement indices in the kinesiology literature for continuous data; and (2) present examples of these indices for hypothetical data along with recommendations for future use. It is recommended that three types of information be…

Descriptors: Interrater Reliability, Evaluation Methods, Kinetics, Indexes

Review of Measurements Used in Computing Education Research and Suggestions for Increasing Standardization

Peer reviewed

Direct link

Margulieux, Lauren; Ketenci, Tuba Ayer; Decker, Adrienne – Computer Science Education, 2019

Background and context: The variables that researchers measure and how they measure them are central in any area of research, including computing education. Which research questions can be asked and how they are answered depends on measurement. Objective: To summarize the commonly used variables and measurements in computing education and to…

Descriptors: Measurement Techniques, Standards, Evaluation Methods, Computer Science Education

A Systematic Review of Methods for Evaluating Rating Quality in Language Assessment

Peer reviewed

Direct link

Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018

The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…

Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability

The Effect of Data Points per x- to y-Axis Ratio on Visual Analysts Evaluation of Single-Case Graphs

Peer reviewed

Direct link

Radley, Keith C.; Dart, Evan H.; Wright, Sarah J. – School Psychology Quarterly, 2018

Research based on single-case designs (SCD) are frequently utilized in educational settings to evaluate the effect of an intervention on student behavior. Visual analysis is the primary method of evaluation of SCD, despite research noting concerns regarding reliability of the procedure. Recent research suggests that characteristics of the graphic…

Descriptors: Graphs, Evaluation Methods, Data, Intervention

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Examining the Reliability of Scores from the Consensual Assessment Technique in the Measurement of Individual and Small Group Creativity

Peer reviewed

Direct link

Stefanic, Nicholas; Randles, Clint – Music Education Research, 2015

The purpose of this study was to explore the reliability of measures of both individual and group creative work using the consensual assessment technique (CAT). CAT was used to measure individual and group creativity among a population of pre-service music teachers enrolled in a secondary general music class (n = 23) and was evaluated from…

Descriptors: Music Education, Creativity, Preservice Teachers, Music Teachers

The Association between Teachers' Use of Formative Assessment Practices and Students' Use of Self-Regulated Learning Strategies. Appendixes. REL 2021-041

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory West, 2020

These are the appendixes for the report, "The Association between Teachers' Use of Formative Assessment Practices and Students' Use of Self-Regulated Learning Strategies." Two appendixes are included in this document. Appendix A are the methods of the study. This includes the reliability of the teacher and student surveys and the…

Descriptors: Formative Evaluation, Learning Strategies, Elementary School Students, Elementary School Teachers

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

Curriculum-Based Measurement of Reading: An Evaluation of Frequentist and Bayesian Methods to Model Progress Monitoring Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Christ, Theodore J.; Desjardins, Christopher David – Journal of Psychoeducational Assessment, 2018

Curriculum-Based Measurement of Oral Reading (CBM-R) is often used to monitor student progress and guide educational decisions. Ordinary least squares regression (OLSR) is the most widely used method to estimate the slope, or rate of improvement (ROI), even though published research demonstrates OLSR's lack of validity and reliability, and…

Descriptors: Bayesian Statistics, Curriculum Based Assessment, Oral Reading, Least Squares Statistics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Educational and Psychological…	7
ProQuest LLC	5
Regional Educational…	4
Action in Teacher Education	2
American Journal of…	2
Applied Measurement in…	2
Assessment & Evaluation in…	2
Assessment for Effective…	2
Distance Education	2
Educational Evaluation and…	2
Higher Education Studies	2
Journal of Education and…	2
Journal of Psychoeducational…	2
Language Testing	2
Measurement in Physical…	2
Personnel Psychology	2
Research on Social Work…	2
Advances in Health Sciences…	1
Advances in Physiology…	1
American Institutes for…	1
American Journal of Evaluation	1
American Journal of Health…	1
American Psychologist	1
Annual Review of Applied…	1
Audio-Visual Language Journal	1
More ▼

Gill, Brian	3
Booker, Kevin	2
Bruch, Julie	2
Lee, Yun Soo	2
Lembke, Erica S.	2
A. C., John	1
Abad, Francisco J.	1
Abu-Hamour, Bashir	1
Al-Mahasneh, Ruba	1
Al-Oshaibat, Hussein	1
Al-Tarawneh, Sabri	1
Alemi, Minoo	1
Ames, Russell	1
Amrein-Beardsley, Audrey	1
Angus, Megan Hague	1
Anthony, Jennifer	1
Antoniou, Panayiotis	1
Arango, Lisa Lewis	1
Arce-Ferrer, Alvaro J.	1
Athan, Athit	1
Atkinson, Dianne	1
Bahreini, Kiavash	1
Barth, Amy E.	1
Bastian, Amy J.	1
More ▼

Journal Articles	121
Reports - Research	103
Reports - Evaluative	29
Speeches/Meeting Papers	12
Reports - Descriptive	11
Information Analyses	9
Tests/Questionnaires	9
Numerical/Quantitative Data	6
Dissertations/Theses -…	5
Guides - Non-Classroom	5
Books	4
Opinion Papers	3
Collected Works - General	1
Collected Works - Proceedings	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Reference Materials -…	1
Reference Materials -…	1
More ▼

ACT Assessment	2
Autism Diagnostic Observation…	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Program for International…	2
Stanford Achievement Tests	2
Adjective Check List	1
Child Behavior Checklist	1
Conners Rating Scales	1
Conners Teacher Rating Scale	1
Diagnostic Interview Schedule…	1
Family Adaptability Cohesion…	1
Georgia Criterion Referenced…	1
MacArthur Communicative…	1
Motivated Strategies for…	1
Mullen Scales of Early…	1
National Longitudinal…	1
Parenting Stress Index	1
Personal Orientation Inventory	1
Raven Advanced Progressive…	1
Raven Progressive Matrices	1
Reading Miscue Inventory	1
State Trait Anxiety Inventory	1
Trends in International…	1
More ▼