Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 5 |
Descriptor
Comparative Testing | 17 |
Evaluation Methods | 17 |
Test Reliability | 17 |
Test Validity | 10 |
Foreign Countries | 5 |
Computer Assisted Testing | 3 |
Diagnostic Tests | 3 |
Elementary Secondary Education | 3 |
Factor Analysis | 3 |
Factor Structure | 3 |
Higher Education | 3 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 13 |
Journal Articles | 9 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 2 | 1 |
Primary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Beck Depression Inventory | 1 |
Hamilton Rating Scale for… | 1 |
Maslach Burnout Inventory | 1 |
Minnesota Multiphasic… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Morrison, Keith – Educational Research and Evaluation, 2013
This paper reviews the literature on comparing online and paper course evaluations in higher education and provides a case study of a very large randomised trial on the topic. It presents a mixed but generally optimistic picture of online course evaluations with respect to response rates, what they indicate, and how to increase them. The paper…
Descriptors: Literature Reviews, Course Evaluation, Case Studies, Higher Education
Templer, Donald I.; Hartlage, Lawrence C. – J Clin Psychol, 1969
Descriptors: Clinical Diagnosis, Comparative Testing, Evaluation Methods, Intelligence Quotient

Hesselbrock, Michie N.; And Others – Journal of Consulting and Clinical Psychology, 1983
Compared three instruments assessing depression in alcoholics: Diagnostic and Statistical Manual of Mental Disorders (DSM-II), the Minnesota Multiphasic Personality Inventory Depression scale (MMPI D), and the Beck Depression Inventory (BDI). The number of subjects who were diagnosed as "depressed" varied considerably according to the…
Descriptors: Alcoholism, Comparative Testing, Depression (Psychology), Diagnostic Tests

O'Hara, Michael W.; Rehm, Lynn P. – Journal of Consulting and Clinical Psychology, 1983
Used the intraclass correlation coefficient to estimate the interrater reliability of judgments of clinician and novice raters of depressed females (N=20) who took the Hamilton Rating Scale for Depression (HRSD). Expert and student raters both made reliable ratings on the HRSD. Criterion validity for student raters was also satisfactory.…
Descriptors: College Students, Comparative Testing, Cost Effectiveness, Counselor Role

Anglin, M. Douglas; And Others – Evaluation Review, 1993
Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)
Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods
Bezruczko, Nikolaus; Schroeder, David H. – 1989
An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…
Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation
Costantino, Giuseppe; And Others – 1989
Attention deficits and attention deficit-hyperactivity disorder (AD-HD) are regarded as relatively common disorders among school-age children, but the literature reveals several confounding factors with the standard assessment techniques for the disorder. Using a structured thematic apperception technique (the TEMAS Apperception Test of G.…
Descriptors: Adolescents, Attention Deficit Disorders, Children, Comparative Testing
Babcock, Judith L.; And Others – 1992
This study used multiple methods to assess basic community needs and attributes of community atmosphere (cohesion, religious involvement, and recreational activities) in two psychometric studies. Part 1 revised self-report community assessment measures, developed multi-item scales for each construct, and tested reliabilities and factor structures…
Descriptors: Community Needs, Community Organizations, Community Programs, Comparative Testing
Awomolo, Ademola – 1992
The evolution of the West African Examinations Council (WAEC) Senior School Certificate Examination (SSCE) and certification process is traced. The challenges posed by combining, for certification purposes, the scores from internal and external assessments of school candidates are discussed in the face of the low reliability of teachers'…
Descriptors: Academic Standards, Comparative Testing, Educational Assessment, Educational Certificates
Weiss, David J., Ed. – 1980
This report is the Proceedings of the third conference of its type. Included are 23 of the 25 papers presented at the conference, discussion of these papers by invited discussants, and symposium papers by a group of leaders in adaptive testing and latent trait test theory research and applications. The papers are organized into the following…
Descriptors: Academic Ability, Academic Achievement, Comparative Testing, Computer Assisted Testing

Byrne, Barbara M. – Multivariate Behavioral Research, 1991
The factorial validity of the Maslach Burnout Inventory (MBI) and the equivalence of factorial measurements and structure across groups were studied for 163 intermediate-grade teachers, 162 secondary school teachers, and 218 university teachers in Canada. Reasons why the MBI may not be appropriate for university educators are discussed. (SLD)
Descriptors: Comparative Testing, Elementary School Teachers, Elementary Secondary Education, Emotional Adjustment
Previous Page | Next Page ยป
Pages: 1 | 2