ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	6

Descriptor

Comparative Testing	22
Evaluation Methods	22
Test Validity	22
Test Reliability	10
Foreign Countries	6
Comparative Analysis	4
Scores	4
Error of Measurement	3
Higher Education	3
Longitudinal Studies	3
Questionnaires	3
Academic Ability	2
Academic Achievement	2
Academic Standards	2
Achievement Gains	2
Achievement Tests	2
Children	2
Computer Assisted Testing	2
Educational Assessment	2
Elementary Education	2
Elementary Secondary Education	2
Evaluation Criteria	2
Evaluation Research	2
Factor Analysis	2
Factor Structure	2
More ▼

Source

Advances in Physiology…	1
Evaluation Review	1
Field Methods	1
Grantee Submission	1
International Review of…	1
Journal of Consulting and…	1
Journal of Drug Education	1
Journal of Educational…	1
Journal of Multilingual and…	1
Journal of Vocational Behavior	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
School Science and Mathematics	1
More ▼

Publication Type

Reports - Research	17
Journal Articles	13
Reports - Evaluative	4
Speeches/Meeting Papers	3
Collected Works - Proceedings	1

Education Level

Higher Education	3
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Primary Education	1

Audience

Location

Germany	2
Canada	1
Ethiopia	1
Illinois (Chicago)	1
Kenya	1
Nigeria	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Holland Vocational Preference…	1
Maslach Burnout Inventory	1
National Assessment of…	1
Strong Vocational Interest…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Assessing Vocabulary Knowledge in Written and Signed Languages of Immigrant DHH Learners -- Examining Convergent Validity

Peer reviewed

Direct link

Nicole Marx; Wolfgang Mann – Journal of Multilingual and Multicultural Development, 2025

Language assessment is a central aspect not only of language education in the general population, but also amongst heterogeneous, low-incidence populations. One such population are immigrant deaf and hard-of-hearing learners (IDML) who are bimodal-multilingual and whose languages development often includes the spoken, written, and/or signed…

Descriptors: Foreign Countries, German, Sign Language, Immigrants

Poverty and Wealth without a Ladder? An Appraisal of the Stages of Progress Method among Agro-Pastoralists in Ethiopia's Lower Omo Valley

Peer reviewed

Direct link

Edward G. J. Stevenson; Jil Molenaar; David-Paul Pertaub; Dessalegn Tekle – Field Methods, 2025

Is it possible to measure wealth and poverty across settings while being faithful to local understandings? The stages of progress method (SoP) attempts to do this by building ladders of wealth in locally relevant terms and using these in comparisons across groups. This approach is potentially useful among pastoralist populations where monetary…

Descriptors: Foreign Countries, Poverty, Social Mobility, Evaluation Methods

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Assessing Reading Fluency in Kenya: Oral or Silent Assessment?

Peer reviewed

Direct link

Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015

In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…

Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading

Empirical Derivation of SVIB-Holland Scales and Conversion Tables

Peer reviewed

Holland, Thomas A.; And Others – Journal of Vocational Behavior, 1974

Significant relationships between the Holland Vocational Preference Inventory (VPI) and the Strong Vocational Interest Blank (SVIB) were again empirically demonstrated in this study, and conversion equations were developed to use standard scores of SVIB scales, rather than items, to produce estimates of VPI scores. (Author)

Descriptors: Comparative Analysis, Comparative Testing, Evaluation Methods, Occupational Aspiration

Validity of Two Scoring Systems for Measuring Cognitive Development with the Rorschach.

Peer reviewed

Ridley, Stanley E.; Bayton, James A. – Journal of Consulting and Clinical Psychology, 1983

Examined and compared the validity of Friedman's Developmental Level (DL) and Exner's Developmental Quality (DQ) as measures of cognitive development in children (N=134). Results supported the convergent and discriminant validity of both DL and DQ. The DL and DQ were most strongly related to different types of cognitive ability. (JAC)

Descriptors: Children, Cognitive Ability, Cognitive Development, Cognitive Measurement

Reliability and Validity of Retrospective Behavioral Self-Report by Narcotics Addicts.

Peer reviewed

Anglin, M. Douglas; And Others – Evaluation Review, 1993

Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)

Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods

Traditional versus Rasch Scaling of Aggregate Data in the Multitrait-Multimethod Matrix.

Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982

Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…

Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education

Assessing the Validity of Self-Reported Adolescent Cigarette Smoking.

Peer reviewed

Martin, Gary L.; Newman, Ian M. – Journal of Drug Education, 1988

Compared adolescent cigarette smoking rates determined by traditional questionnaire, random response questionnaire, and carbon monoxide test. Results from 1,160 ninth graders in 40 classrooms in 7 schools indicated that random response questionnaire elicited statistically larger proportion of smokers than did traditional questionnaire. Neither…

Descriptors: Adolescents, Comparative Testing, Evaluation Methods, Grade 9

Artistic Judgment Project I: Internal-Structure Analyses. Technical Report 1989-2.

Bezruczko, Nikolaus; Schroeder, David H. – 1989

An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…

Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation

An Analysis of the Evaluation Data When ESEA, Title 1 Evaluation Models A1 and A2 are Empirically Field Tested Simultaneously.

Fish, Owen W. – 1979

Two ESEA Title I evaluation models developed by the Resource Management Corporation (RMC), were field tested simultaneously with 560 Title I reading students, grades 2-8. Measuring instruments for models 1 and 2 were, respectively, the California Achievement Test (reading vocabulary section), a norm-referenced test; and the Tarmac Reading…

Descriptors: Achievement Gains, Comparative Testing, Compensatory Education, Criterion Referenced Tests

National Educational Assessment: Pro and Con.

Download full text

American Association of School Administrators, Washington, DC. – 1966

In this publication, designed to serve interested laymen as well as educators, various authors explore the viewpoints of the proponents and the opponents of the National Assessment Program. In their analysis of assessment and its related issues, these authors attempt to provide information that could serve as a basis for an objective consideration…

Descriptors: Achievement Tests, Comparative Analysis, Comparative Testing, Curriculum Evaluation

Multiple-Choice and Alternate-Choice Questions: Description and Analysis.

Download full text

Dowd, Steven B. – 1992

An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Anglin, M. Douglas	1
Awomolo, Ademola	1
Banta, Trudy W.	1
Bayton, James A.	1
Bezruczko, Nikolaus	1
Byrne, Barbara M.	1
Cantrell, Pamela	1
David-Paul Pertaub	1
Dessalegn Tekle	1
Dowd, Steven B.	1
Edward G. J. Stevenson	1
Fish, Owen W.	1
Hamid Mohammadi	1
Holland, Thomas A.	1
Jil Molenaar	1
Ke-Hai Yuan	1
Lijuan Wang	1
Mark J. Gierl	1
Martin, Gary L.	1
Naron, Nancy Klastorin	1
Newman, Ian M.	1
Nicole Marx	1
Ole J. Kemi	1
Pike, Gary	1
More ▼