ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	17
Evaluation Methods	17
Test Reliability	17
Test Validity	10
Foreign Countries	5
Computer Assisted Testing	3
Diagnostic Tests	3
Elementary Secondary Education	3
Factor Analysis	3
Factor Structure	3
Higher Education	3
Interrater Reliability	3
Test Construction	3
Academic Standards	2
College Students	2
Depression (Psychology)	2
Error of Measurement	2
Interviews	2
Learning Disabilities	2
Longitudinal Studies	2
Psychological Evaluation	2
Questionnaires	2
Scores	2
Testing Problems	2
Academic Ability	1
More ▼

Source

Journal of Consulting and…	2
Advances in Physiology…	1
Educational Research and…	1
Evaluation Review	1
Grantee Submission	1
International Review of…	1
J Clin Psychol	1
Journal of Educational…	1
Multivariate Behavioral…	1

Publication Type

Reports - Research	13
Journal Articles	9
Reports - Evaluative	3
Speeches/Meeting Papers	3
Collected Works - Proceedings	1

Education Level

Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Primary Education	1

Audience

Location

Canada	1
China	1
Germany	1
Illinois (Chicago)	1
Kenya	1
Nigeria	1

Laws, Policies, & Programs

Assessments and Surveys

Beck Depression Inventory	1
Hamilton Rating Scale for…	1
Maslach Burnout Inventory	1
Minnesota Multiphasic…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Assessing Reading Fluency in Kenya: Oral or Silent Assessment?

Peer reviewed

Direct link

Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015

In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…

Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading

Online and Paper Evaluations of Courses: A Literature Review and Case Study

Peer reviewed

Direct link

Morrison, Keith – Educational Research and Evaluation, 2013

This paper reviews the literature on comparing online and paper course evaluations in higher education and provides a case study of a very large randomised trial on the topic. It presents a mixed but generally optimistic picture of online course evaluations with respect to response rates, what they indicate, and how to increase them. The paper…

Descriptors: Literature Reviews, Course Evaluation, Case Studies, Higher Education

Physicians' I.Q. Estimates and Kent I.Q. Compared with WAIS I.Q

Templer, Donald I.; Hartlage, Lawrence C. – J Clin Psychol, 1969

Descriptors: Clinical Diagnosis, Comparative Testing, Evaluation Methods, Intelligence Quotient

Methodological Considerations in the Assessment of Depression in Alcoholics.

Peer reviewed

Hesselbrock, Michie N.; And Others – Journal of Consulting and Clinical Psychology, 1983

Compared three instruments assessing depression in alcoholics: Diagnostic and Statistical Manual of Mental Disorders (DSM-II), the Minnesota Multiphasic Personality Inventory Depression scale (MMPI D), and the Beck Depression Inventory (BDI). The number of subjects who were diagnosed as "depressed" varied considerably according to the…

Descriptors: Alcoholism, Comparative Testing, Depression (Psychology), Diagnostic Tests

Hamilton Rating Scale for Depression: Reliability and Validity of Judgments of Novice Raters.

Peer reviewed

O'Hara, Michael W.; Rehm, Lynn P. – Journal of Consulting and Clinical Psychology, 1983

Used the intraclass correlation coefficient to estimate the interrater reliability of judgments of clinician and novice raters of depressed females (N=20) who took the Hamilton Rating Scale for Depression (HRSD). Expert and student raters both made reliable ratings on the HRSD. Criterion validity for student raters was also satisfactory.…

Descriptors: College Students, Comparative Testing, Cost Effectiveness, Counselor Role

Reliability and Validity of Retrospective Behavioral Self-Report by Narcotics Addicts.

Peer reviewed

Anglin, M. Douglas; And Others – Evaluation Review, 1993

Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)

Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods

Artistic Judgment Project I: Internal-Structure Analyses. Technical Report 1989-2.

Bezruczko, Nikolaus; Schroeder, David H. – 1989

An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…

Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation

Assessment of Attention Deficit Disorder Using a Thematic Apperception Technique.

Download full text

Costantino, Giuseppe; And Others – 1989

Attention deficits and attention deficit-hyperactivity disorder (AD-HD) are regarded as relatively common disorders among school-age children, but the literature reveals several confounding factors with the standard assessment techniques for the disorder. Using a structured thematic apperception technique (the TEMAS Apperception Test of G.…

Descriptors: Adolescents, Attention Deficit Disorders, Children, Comparative Testing

Assessing Construct Validity in Community Evaluations: A Multitrait-Multimethod Approach.

Download full text

Babcock, Judith L.; And Others – 1992

This study used multiple methods to assess basic community needs and attributes of community atmosphere (cohesion, religious involvement, and recreational activities) in two psychometric studies. Part 1 revised self-report community assessment measures, developed multi-item scales for each construct, and tested reliabilities and factor structures…

Descriptors: Community Needs, Community Organizations, Community Programs, Comparative Testing

The Challenges of Combining Internal and External Assessment in Certificate Examinations: The West African Examinations Council Experience.

Download full text

Awomolo, Ademola – 1992

The evolution of the West African Examinations Council (WAEC) Senior School Certificate Examination (SSCE) and certification process is traced. The challenges posed by combining, for certification purposes, the scores from internal and external assessments of school candidates are discussed in the face of the low reliability of teachers'…

Descriptors: Academic Standards, Comparative Testing, Educational Assessment, Educational Certificates

Proceedings of the 1979 Computerized Adaptive Testing Conference (Wayzata, Minnesota, June 27-30, 1979).

Weiss, David J., Ed. – 1980

This report is the Proceedings of the third conference of its type. Included are 23 of the 25 papers presented at the conference, discussion of these papers by invited discussants, and symposium papers by a group of leaders in adaptive testing and latent trait test theory research and applications. The papers are organized into the following…

Descriptors: Academic Ability, Academic Achievement, Comparative Testing, Computer Assisted Testing

The Maslach Burnout Inventory: Validating Factorial Structure and Invariance across Intermediate, Secondary, and University Educators.

Peer reviewed

Byrne, Barbara M. – Multivariate Behavioral Research, 1991

The factorial validity of the Maslach Burnout Inventory (MBI) and the equivalence of factorial measurements and structure across groups were studied for 163 intermediate-grade teachers, 162 secondary school teachers, and 218 university teachers in Canada. Reasons why the MBI may not be appropriate for university educators are discussed. (SLD)

Descriptors: Comparative Testing, Elementary School Teachers, Elementary Secondary Education, Emotional Adjustment

Previous Page | Next Page »

Pages: 1 | 2

Allison, Howard K., II	1
Anglin, M. Douglas	1
Awomolo, Ademola	1
Babcock, Judith L.	1
Bezruczko, Nikolaus	1
Byrne, Barbara M.	1
Costantino, Giuseppe	1
Hamid Mohammadi	1
Hartlage, Lawrence C.	1
Hesselbrock, Michie N.	1
Ke-Hai Yuan	1
Lijuan Wang	1
Mark J. Gierl	1
Morrison, Keith	1
Naron, Nancy Klastorin	1
O'Hara, Michael W.	1
Ole J. Kemi	1
Piper, Benjamin	1
Rehm, Lynn P.	1
Schroeder, David H.	1
Tahereh Firoozi	1
Templer, Donald I.	1
Weiss, David J., Ed.	1
Zhiyong Zhang	1
More ▼