Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 11 |
Descriptor
| Comparative Testing | 30 |
| Evaluation Methods | 30 |
| Test Validity | 22 |
| Test Reliability | 11 |
| Comparative Analysis | 7 |
| Foreign Countries | 7 |
| Scores | 5 |
| Academic Achievement | 4 |
| Achievement Gains | 4 |
| Construct Validity | 4 |
| Educational Assessment | 4 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 21 |
| Journal Articles | 18 |
| Reports - Evaluative | 8 |
| Speeches/Meeting Papers | 4 |
| Collected Works - Proceedings | 1 |
Education Level
| Higher Education | 4 |
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Postsecondary Education | 2 |
| Early Childhood Education | 1 |
| Grade 2 | 1 |
| Primary Education | 1 |
Audience
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
| Holland Vocational Preference… | 1 |
| Maslach Burnout Inventory | 1 |
| McCarthy Scales of Childrens… | 1 |
| National Assessment of… | 1 |
| Sixteen Personality Factor… | 1 |
| Strong Vocational Interest… | 1 |
What Works Clearinghouse Rating
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Nicole Marx; Wolfgang Mann – Journal of Multilingual and Multicultural Development, 2025
Language assessment is a central aspect not only of language education in the general population, but also amongst heterogeneous, low-incidence populations. One such population are immigrant deaf and hard-of-hearing learners (IDML) who are bimodal-multilingual and whose languages development often includes the spoken, written, and/or signed…
Descriptors: Foreign Countries, German, Sign Language, Immigrants
Edward G. J. Stevenson; Jil Molenaar; David-Paul Pertaub; Dessalegn Tekle – Field Methods, 2025
Is it possible to measure wealth and poverty across settings while being faithful to local understandings? The stages of progress method (SoP) attempts to do this by building ladders of wealth in locally relevant terms and using these in comparisons across groups. This approach is potentially useful among pastoralist populations where monetary…
Descriptors: Foreign Countries, Poverty, Social Mobility, Evaluation Methods
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Wida Wemmer-Rogh; Urs Grob; Charalambos Y. Charalambous; Anna-Katharina Praetorius – ZDM: Mathematics Education, 2024
Recent publications emphasize the need to take greater account of differences in teaching quality between subjects. The empirical analysis of this topic requires a comparison of teaching quality in different subjects to distinguish generic aspects of teaching quality from subject-specific ones. In this paper, we compare teaching quality in…
Descriptors: Foreign Countries, Elementary School Mathematics, Elementary School Students, Elementary School Teachers
St.Clair, Travis; Cook, Thomas D.; Hallberg, Kelly – American Journal of Evaluation, 2014
Although evaluators often use an interrupted time series (ITS) design to test hypotheses about program effects, there are few empirical tests of the design's validity. We take a randomized experiment on an educational topic and compare its effects to those from a comparative ITS (CITS) design that uses the same treatment group as the experiment…
Descriptors: Time, Evaluation Methods, Measurement Techniques, Research Design
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Williams, Rihana Shiri; Ari, Omer; Santamaria, Carmen Nicole – Journal of Research in Reading, 2011
Recent investigations challenge the construct validity of sustained silent reading tests. Performance of two groups of post-secondary students (e.g. struggling and non-struggling) on a sustained silent reading test and two types of cloze test (i.e. maze and open-ended) was compared in order to identify the test format that contributes greater…
Descriptors: Evidence, Cloze Procedure, Reading Comprehension, Investigations
Pike, Gary; Banta, Trudy W. – 1989
The purpose of this paper is (1) to discuss a set of standards that can be used to evaluate potential assessment instruments; and (2) to use these standards to evaluate the American College Testing Program's College Outcomes Measures Program (ACT-COMP) and the Educational Testing Service (ETS) Academic Profile. Using the work of S. Messick (1975,…
Descriptors: Academic Ability, Achievement Tests, College Seniors, Comparative Testing
Cantrell, Pamela – School Science and Mathematics, 2003
The difference in gain scores produced by traditional pretests and those produced by retrospective pretests when compared to posttest scores on the Science Teaching Efficacy Belief Instrument for preservice teachers was investigated in this study. Results indicated that gain scores using the traditional pretest produced significant improvement in…
Descriptors: Pretests Posttests, Validity, Scores, Preservice Teachers
Peer reviewedHolland, Thomas A.; And Others – Journal of Vocational Behavior, 1974
Significant relationships between the Holland Vocational Preference Inventory (VPI) and the Strong Vocational Interest Blank (SVIB) were again empirically demonstrated in this study, and conversion equations were developed to use standard scores of SVIB scales, rather than items, to produce estimates of VPI scores. (Author)
Descriptors: Comparative Analysis, Comparative Testing, Evaluation Methods, Occupational Aspiration
Miron, Gary; Applegate, Brooks – Education and the Public Interest Center, 2009
The Center for Research on Education Outcomes (CREDO) at Stanford University conducted a large-scale analysis of the impact of charter schools on student performance. The center's data covered 65-70% of the nation's charter schools. Although results varied by state, 17% of the charter school students have significantly higher math results than …
Descriptors: Evidence, Traditional Schools, Charter Schools, Program Effectiveness
Reardon, Sean F. – Education and the Public Interest Center, 2009
"How New York City's Charter Schools Affect Achievement" estimates the effects on student achievement of attending a New York City charter school rather than a traditional public school and investigates the characteristics of charter schools associated with the most positive effects on achievement. Because the report relies on an…
Descriptors: Charter Schools, Academic Achievement, Achievement Gains, Achievement Rating
Peer reviewedRidley, Stanley E.; Bayton, James A. – Journal of Consulting and Clinical Psychology, 1983
Examined and compared the validity of Friedman's Developmental Level (DL) and Exner's Developmental Quality (DQ) as measures of cognitive development in children (N=134). Results supported the convergent and discriminant validity of both DL and DQ. The DL and DQ were most strongly related to different types of cognitive ability. (JAC)
Descriptors: Children, Cognitive Ability, Cognitive Development, Cognitive Measurement
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
