Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 11 |
Descriptor
Comparative Testing | 31 |
Foreign Countries | 31 |
Test Reliability | 31 |
Test Validity | 16 |
Test Construction | 9 |
Cross Cultural Studies | 6 |
Elementary Secondary Education | 5 |
Evaluation Methods | 5 |
Student Evaluation | 5 |
Test Format | 5 |
Adults | 4 |
More ▼ |
Source
Author
Marsh, Herbert W. | 2 |
Alwis, W. A. M. | 1 |
Awomolo, Ademola | 1 |
Beck, Klaus | 1 |
Bijsterbosch, Erik | 1 |
Bontempo, Robert | 1 |
Bradbury, Alice | 1 |
Byrne, Barbara M. | 1 |
Costantino, Giuseppe | 1 |
Elosua, Paula | 1 |
Goldstein, Harvey | 1 |
More ▼ |
Publication Type
Reports - Research | 23 |
Journal Articles | 20 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 5 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 5 |
Early Childhood Education | 2 |
Elementary Education | 2 |
Postsecondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 2 | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 2 |
Teachers | 2 |
Location
Australia | 4 |
Canada | 4 |
China | 4 |
United States | 4 |
Ireland | 2 |
Israel | 2 |
Singapore | 2 |
United Kingdom | 2 |
United Kingdom (England) | 2 |
Argentina | 1 |
Austria | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Coopersmith Self Esteem… | 1 |
Maslach Burnout Inventory | 1 |
Raven Progressive Matrices | 1 |
Test of Economic Literacy | 1 |
Vineland Adaptive Behavior… | 1 |
What Works Clearinghouse Rating
Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024
In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…
Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014
Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…
Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Morrison, Keith – Educational Research and Evaluation, 2013
This paper reviews the literature on comparing online and paper course evaluations in higher education and provides a case study of a very large randomised trial on the topic. It presents a mixed but generally optimistic picture of online course evaluations with respect to response rates, what they indicate, and how to increase them. The paper…
Descriptors: Literature Reviews, Course Evaluation, Case Studies, Higher Education
Lew, Magdeleine D. N.; Alwis, W. A. M.; Schmidt, Henk G. – Assessment & Evaluation in Higher Education, 2010
The purpose of the two studies presented here was to evaluate the accuracy of students' self-assessment ability, to examine whether this ability improves over time and to investigate whether self-assessment is more accurate if students believe that it contributes to improving learning. To that end, the accuracy of the self-assessments of 3588…
Descriptors: Self Evaluation (Individuals), Beliefs, Learning Processes, Correlation
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Korat, Ofra – Early Child Development and Care, 2009
The relationship between mothers' and educators' evaluation of 75 children's emergent literacy levels and actual levels were investigated. Two groups of mothers participated: mothers with a low education and mothers with a high education. The children's emergent literacy was measured. The mothers evaluated their own children and 40 teachers…
Descriptors: Mothers, Emergent Literacy, Interrater Reliability, Mother Attitudes
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests

Neto, Felix – Journal of Youth and Adolescence, 1993
The applicability of the Satisfaction With Life Scale (SWLS), developed in the United States, to another culture was assessed by investigating reliability and validity of the SWLS with 99 boys and 118 girls from Portugal. The cross-national validity of the scale and its utility with different age groups are supported. (SLD)
Descriptors: Adolescents, Age Differences, Attitude Measures, Comparative Testing

Shek, Daniel T. L.; Mak, Wai Kwong – Chinese University Education Journal, 1989
Describes the chronic and acute subscales (SOMAC and SOMAA) of the Chinese Somatic Scale that was administered to 2150 Hong Kong secondary education students to measure their psychological well being. Results showed that both the SOMAC and SOMAA had acceptable reliability and validity status as compared with similar type tests. (GG)
Descriptors: Comparative Testing, Diagnostic Tests, Emotional Disturbances, Foreign Countries

Gustafsson, Jan-Eric; Undheim, Johan Olav – Journal of Educational Psychology, 1992
The stability of some dimensions of ability between the ages of 12 and 15 years was investigated for 225 boys and 242 girls in Sweden. Testing in grades 6, 8, and 9 indicated high stability for the general intelligence factor and for the residual of the General Visual factor. (SLD)
Descriptors: Ability, Adolescents, Age Differences, Comparative Testing
Holburn, P. T. – 1992
Research is reported on four tests commonly used in South Africa to select apprentices, the Intermediate Mental Alertness Test, the High Level Figure Classification Test, the Blox Test, and the Mechanical Comprehension Test. Samples were as follows: (1) 206 Asian, 208 Black, 102 Coloured, and 99 White mostly male applicants for sugar industry…
Descriptors: Adults, Apprenticeships, Blacks, Comparative Testing