ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	57

Descriptor

Evaluation Methods	131
Test Validity	131
Models	108
Test Reliability	64
Test Construction	34
Student Evaluation	27
Foreign Countries	22
Measurement Techniques	22
Statistical Analysis	20
Higher Education	16
Criterion Referenced Tests	14
Program Evaluation	14
Factor Analysis	13
Teacher Effectiveness	13
Academic Achievement	12
Evaluation Criteria	12
Teacher Evaluation	12
Correlation	11
Educational Assessment	11
Item Analysis	11
Mathematical Models	10
Questionnaires	10
Testing	10
Test Bias	9
Test Items	9
More ▼

Education Level

Higher Education	16
Postsecondary Education	7
Elementary Secondary Education	5
Elementary Education	4
Adult Education	3
Early Childhood Education	3
Grade 10	3
Grade 7	3
High Schools	3
Secondary Education	3
Grade 11	2
Grade 12	2
Grade 9	2
Kindergarten	2
Middle Schools	2
Primary Education	2
Grade 1	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Practitioners	6
Researchers	3
Policymakers	2
Administrators	1
Counselors	1
Teachers	1

Location

United Kingdom (England)	3
Florida	2
Ghana	2
Japan	2
United Kingdom	2
Australia	1
Brazil	1
California	1
Canada	1
China	1
Colorado (Denver)	1
Georgia	1
Germany	1
Indonesia	1
Italy	1
Latvia	1
Malaysia	1
Maryland (Baltimore)	1
Netherlands	1
New Mexico	1
New York (New York)	1
North Carolina (Charlotte)	1
Russia	1
Spain	1
Sweden	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Acculturation Rating Scale…	1
Early Childhood Environment…	1
Early Childhood Longitudinal…	1
Georgia Criterion Referenced…	1
Hidden Figures Test	1
Motivated Strategies for…	1
National Assessment of…	1
Wechsler Intelligence Scale…	1
Wonderlic Personnel Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 131 results Save | Export

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Application of Rasch Model in Two-Tier Test for Assessing Critical Thinking in Physics Education

Peer reviewed
PDF on ERIC

Download full text

Sujiyani Kassiavera; A. Suparmi; C. Cari; Sukarmin Sukarmin – Journal of Baltic Science Education, 2024

The challenge of accurately assessing critical thinking in physics education, particularly on topics like work and energy, remains a key issue for educators. The current study aims to address this challenge by exploring students' critical thinking abilities using two-tier test data analyzed through the Rasch model. Data were collected from…

Descriptors: Critical Thinking, Physics, Science Instruction, Foreign Countries

Modeling Mediation in the Dynamic Assessment of Listening Ability from the Cognitive Diagnostic Perspective

Peer reviewed

Direct link

Meng, Yaru; Fu, Hua – Modern Language Journal, 2023

The distinguishing feature of dynamic assessment (DA) is the dialectical integration of assessment and instruction. However, how to design the targeted instruction or mediation has been relatively underexplored. To address this gap, this study proposes the attribute-based mediation model (AMM), an English-as-a-foreign-language listening mediation…

Descriptors: Evaluation Methods, Teaching Methods, Models, English (Second Language)

The Operations Triad Model and Youth Mental Health Assessments: Catalyzing a Paradigm Shift in Measurement Validation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Andres De Los Reyes; Mo Wang; Matthew D. Lerner; Bridget A. Makol; Olivia M. Fitzpatrick; John R. Weisz – Grantee Submission, 2022

Researchers strategically assess youth mental health by soliciting reports from multiple informants. Typically, these informants (e.g., parents, teachers, youth themselves) vary in the social contexts where they observe youth. Decades of research reveal that the most common data conditions produced with this approach consist of discrepancies…

Descriptors: Mental Health, Measurement Techniques, Evaluation Methods, Research

Evidence Regarding the Internal Structure: Confirmatory Factor Analysis

Peer reviewed

Direct link

Lewis, Todd F. – Measurement and Evaluation in Counseling and Development, 2017

American Educational Research Association (AERA) standards stipulate that researchers show evidence of the internal structure of instruments. Confirmatory factor analysis (CFA) is one structural equation modeling procedure designed to assess construct validity of assessments that has broad applicability for counselors interested in instrument…

Descriptors: Educational Research, Factor Analysis, Structural Equation Models, Construct Validity

Evaluation Tool for Traditional Chinese Medicine Students in China: A Competency Perspective

Peer reviewed

Direct link

Zheng, Boyang; Sun, Guiping; Wang, Hourong – SAGE Open, 2019

Traditional Chinese medicine (TCM) is an important component of China's medical system. How to educate TCM practitioners in China, therefore, has become a crucial issue. To contribute to this issue, the current research identified the competency model of TCM practitioners in China and developed an evaluation for TCM students. We combined Bloom's…

Descriptors: Medical Students, Correlation, Foreign Countries, Test Reliability

Modeling Statistics ITAs' Speaking Performances in a Certification Test

Direct link

Ziwei Zhou – ProQuest LLC, 2020

In light of the ever-increasing capability of computer technology and advancement in speech and natural language processing techniques, automated speech scoring of constructed responses is gaining popularity in many high-stakes assessment and low-stakes educational settings. Automated scoring is a highly interdisciplinary and complex subject, and…

Descriptors: Certification, Speech Skills, Automation, Scoring

Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

Peer reviewed

Direct link

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…

Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy

All Sizzle and No Steak: Value-Added Model Doesn't Add Value in Houston

Direct link

Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017

Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…

Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

The State Kindergarten Entry Assessment Digital Technology Landscape. PERC Report and ETS Research Report Series No. RR-20-26

Peer reviewed
PDF on ERIC

Download full text

Ackerman, Debra J. – ETS Research Report Series, 2020

Over the past 8 years, U.S. kindergarten classrooms have been impacted by policies mandating or recommending the administration of a specific kindergarten entry assessment (KEA) in the initial months of school as well as the increasing reliance on digital technology in the form of mobile apps, touchscreen devices, and online data platforms. Using…

Descriptors: Kindergarten, School Readiness, Computer Assisted Testing, Preschool Teachers

Adapting Scale for Children: A Practical Model for Researchers

Download full text

Aydin, Selami; Harputlu, Leyla; Çelik, Seyda Savran; Ustuk, Özgehan; Güzel, Serhat; Genç, Deniz – Online Submission, 2016

Measurement of children's behaviors in an educational and research context is a problematic and complex area. It is also evident that adapting scales to measure children's behaviors in an educational and research context is a complex process due to several reasons. First, cultural elements constitute a considerable problem. Second, it is difficult…

Descriptors: Child Behavior, Models, Test Construction, Test Validity

Teamwork Skill Assessment: Development of a Measure for Academia

Peer reviewed

Direct link

Varela, Otmar; Mead, Esther – Journal of Education for Business, 2018

Popular teamwork assessments have been strongly criticized on the grounds of poor psychometric properties and their disconnect with conceptual models of teamwork. These issues raise concerns with respect to our ability to evaluate efforts devoted to advancing teamwork in academia. We report the development of a teamwork assessment that builds on…

Descriptors: Teamwork, Evaluation Methods, Test Validity, Psychometrics

Applications of Diagnostic Classification Models: A Literature Review and Critical Commentary

Peer reviewed

Direct link

Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…

Descriptors: Literature Reviews, Classification, Models, Criticism

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational Evaluation and…	4
Applied Measurement in…	3
Grantee Submission	3
ProQuest LLC	3
ETS Research Report Series	2
Educational Measurement:…	2
Evaluation and the Health…	2
Journal of Educational…	2
Online Submission	2
Pearson	2
Phi Delta Kappan	2
Social Indicators Research	2
Alberta Journal of…	1
Annual Review of Applied…	1
Applied Psychological…	1
Assessment	1
Assessment & Evaluation in…	1
Canadian Journal of School…	1
Counselor Education and…	1
Decision Sciences Journal of…	1
Developmental Psychology	1
Education Finance and Policy	1
Educational Assessment	1
Educational Technology…	1
Educational and Psychological…	1
More ▼

Amrein-Beardsley, Audrey	2
Cason, Gerald J.	2
Clark, John L. D.	2
Goldschmidt, Pete	2
Kane, Michael T.	2
McCaffrey, Daniel F.	2
A. Suparmi	1
Abner, Kristin	1
Ackerman, Debra J.	1
Aiga, Hirotsugu	1
Algina, James	1
Aluja, Anton	1
Anderson, Daniel	1
Andres De Los Reyes	1
Arreola, Raoul A.	1
Askov, Eunice	1
Aydin, Selami	1
Bachor, Dan G.	1
Baker, Eva L.	1
Baker, Robert F.	1
Bart, William M.	1
Bartram, Dave	1
Bauer, Christopher F.	1
Bejar, Isaac I.	1
More ▼

Reports - Research	71
Journal Articles	64
Reports - Evaluative	30
Speeches/Meeting Papers	14
Information Analyses	6
Reports - Descriptive	5
Tests/Questionnaires	4
Collected Works - General	3
Collected Works - Proceedings	3
Dissertations/Theses -…	3
Guides - Classroom - Teacher	3
Opinion Papers	3
Reports - General	3
Books	2
Guides - Non-Classroom	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼