NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Elementary and Secondary…1
What Works Clearinghouse Rating
Showing 1 to 15 of 22 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Peer reviewed Peer reviewed
Direct linkDirect link
Nicole Marx; Wolfgang Mann – Journal of Multilingual and Multicultural Development, 2025
Language assessment is a central aspect not only of language education in the general population, but also amongst heterogeneous, low-incidence populations. One such population are immigrant deaf and hard-of-hearing learners (IDML) who are bimodal-multilingual and whose languages development often includes the spoken, written, and/or signed…
Descriptors: Foreign Countries, German, Sign Language, Immigrants
Peer reviewed Peer reviewed
Direct linkDirect link
Edward G. J. Stevenson; Jil Molenaar; David-Paul Pertaub; Dessalegn Tekle – Field Methods, 2025
Is it possible to measure wealth and poverty across settings while being faithful to local understandings? The stages of progress method (SoP) attempts to do this by building ladders of wealth in locally relevant terms and using these in comparisons across groups. This approach is potentially useful among pastoralist populations where monetary…
Descriptors: Foreign Countries, Poverty, Social Mobility, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Peer reviewed Peer reviewed
Holland, Thomas A.; And Others – Journal of Vocational Behavior, 1974
Significant relationships between the Holland Vocational Preference Inventory (VPI) and the Strong Vocational Interest Blank (SVIB) were again empirically demonstrated in this study, and conversion equations were developed to use standard scores of SVIB scales, rather than items, to produce estimates of VPI scores. (Author)
Descriptors: Comparative Analysis, Comparative Testing, Evaluation Methods, Occupational Aspiration
Peer reviewed Peer reviewed
Ridley, Stanley E.; Bayton, James A. – Journal of Consulting and Clinical Psychology, 1983
Examined and compared the validity of Friedman's Developmental Level (DL) and Exner's Developmental Quality (DQ) as measures of cognitive development in children (N=134). Results supported the convergent and discriminant validity of both DL and DQ. The DL and DQ were most strongly related to different types of cognitive ability. (JAC)
Descriptors: Children, Cognitive Ability, Cognitive Development, Cognitive Measurement
Peer reviewed Peer reviewed
Anglin, M. Douglas; And Others – Evaluation Review, 1993
Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)
Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods
Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982
Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…
Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education
Peer reviewed Peer reviewed
Martin, Gary L.; Newman, Ian M. – Journal of Drug Education, 1988
Compared adolescent cigarette smoking rates determined by traditional questionnaire, random response questionnaire, and carbon monoxide test. Results from 1,160 ninth graders in 40 classrooms in 7 schools indicated that random response questionnaire elicited statistically larger proportion of smokers than did traditional questionnaire. Neither…
Descriptors: Adolescents, Comparative Testing, Evaluation Methods, Grade 9
Bezruczko, Nikolaus; Schroeder, David H. – 1989
An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…
Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation
Fish, Owen W. – 1979
Two ESEA Title I evaluation models developed by the Resource Management Corporation (RMC), were field tested simultaneously with 560 Title I reading students, grades 2-8. Measuring instruments for models 1 and 2 were, respectively, the California Achievement Test (reading vocabulary section), a norm-referenced test; and the Tarmac Reading…
Descriptors: Achievement Gains, Comparative Testing, Compensatory Education, Criterion Referenced Tests
American Association of School Administrators, Washington, DC. – 1966
In this publication, designed to serve interested laymen as well as educators, various authors explore the viewpoints of the proponents and the opponents of the National Assessment Program. In their analysis of assessment and its related issues, these authors attempt to provide information that could serve as a basis for an objective consideration…
Descriptors: Achievement Tests, Comparative Analysis, Comparative Testing, Curriculum Evaluation
Dowd, Steven B. – 1992
An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…
Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education
Previous Page | Next Page ยป
Pages: 1  |  2