Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 26 |
Descriptor
Measurement | 56 |
Testing Programs | 56 |
Achievement Tests | 14 |
Educational Assessment | 13 |
Educational Testing | 11 |
Foreign Countries | 11 |
Student Evaluation | 11 |
Testing | 11 |
Academic Achievement | 10 |
Accountability | 10 |
Standardized Tests | 10 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 22 |
Reports - Descriptive | 13 |
Reports - Research | 12 |
Reports - Evaluative | 8 |
Numerical/Quantitative Data | 4 |
Books | 2 |
Guides - General | 2 |
Opinion Papers | 2 |
Collected Works - General | 1 |
Education Level
Audience
Teachers | 3 |
Administrators | 2 |
Practitioners | 2 |
Policymakers | 1 |
Location
Canada | 3 |
Ghana | 2 |
United States | 2 |
Arizona | 1 |
Australia | 1 |
Botswana | 1 |
California | 1 |
Connecticut | 1 |
Delaware | 1 |
Egypt | 1 |
Gambia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Assessments and Surveys
National Assessment of… | 6 |
Trends in International… | 4 |
Delaware Student Testing… | 1 |
Program for International… | 1 |
Progress in International… | 1 |
SAT (College Admission Test) | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Witmer, S. E.; Roschmann, S.; Timmermans, R.; Los, J. – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2022
For test scores to be used to appropriately inform decision-making, and for the associated academic practice opportunities afforded during testing to benefit students, it is critical that students put forth sufficient effort and engagement during testing sessions. Existing research highlights that a substantial proportion of students may disengage…
Descriptors: Learner Engagement, Measurement, Achievement Tests, Access to Education
Luke C. Miller; Beth E. Schueler – Grantee Submission, 2022
COVID-19 significantly impacted the educational and home environments for students throughout the Commonwealth of Virginia in ways that may have impacted student learning, as measured by standardized exams. We analyze statewide, student-level administrative and assessment data on the Standards of Learning (SOL) reading and math tests for the…
Descriptors: COVID-19, Pandemics, Public Schools, Measurement
Tannenbaum, Richard J.; Kane, Michael T. – ETS Research Report Series, 2019
Testing programs are often classified as high or low stakes to indicate how stringently they need to be evaluated. However, in practice, this classification falls short. A high-stakes label is taken to imply that all indicators of measurement quality must meet high standards; whereas a low-stakes label is taken to imply the opposite. This approach…
Descriptors: High Stakes Tests, Testing Programs, Measurement, Evaluation Criteria
Raudonyte, Ieva – UNESCO International Institute for Educational Planning, 2021
Although the number of countries conducting large-scale assessments has increased significantly over the past two decades, this has not necessarily led to the effective use of learning assessment data in policy-making and planning. To better understand the reasons for this, the UNESCO International Institute for Educational Planning (IIEP)…
Descriptors: Foreign Countries, Measurement, Data Use, Educational Planning
Raudonyte, Ieva – UNESCO International Institute for Educational Planning, 2021
Large-scale learning assessments can be used to generate performance and contextual data on student learning outcomes. They can be national, regional, or international; school based or household based. The UNESCO International Institute for Educational Planning (IIEP-UNESCO) has conducted a qualitative study to explore both how and why learning…
Descriptors: Foreign Countries, Measurement, Data Use, Educational Planning
Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Bray, Mark; Kobakhidze, Magda Nutsa – Comparative Education Review, 2014
Expanding numbers of researchers are focusing on the scale and impact of private supplementary tutoring. Such tutoring is widely called shadow education, since much of its curriculum mimics that of regular schooling. Although shadow education has expanded significantly worldwide and is now recognized to have far-reaching significance, research…
Descriptors: Tutoring, Private Education, Educational Research, Measurement
Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014
International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…
Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis
Measuring the Continuum of Literacy Skills among Adults: Educational Testing and the LAMP Experience
Guadalupe, Cesar; Cardoso, Manuel – International Review of Education, 2011
The field of educational testing has become increasingly important for providing different stakeholders and decision-makers with information. This paper discusses basic standards for methodological approaches used in measuring literacy skills among adults. The authors address the increasing interest in skills measurement, the discourses on how…
Descriptors: Adult Literacy, Educational Testing, Testing Programs, Standards
Wagner, Daniel A.; Lockheed, Marlaine; Mullis, Ina; Martin, Michael O.; Kanjee, Anil; Gove, Amber; Dowd, Amy Jo – Compare: A Journal of Comparative and International Education, 2012
Over the past decade, international and national education agencies have begun to emphasize the improvement of the quality (rather than quantity) of education in developing countries. This trend has been paralleled by a significant increase in the use of educational assessments as a way to measure gains and losses in quality of learning. As…
Descriptors: Developing Nations, Foreign Countries, Educational Assessment, Reading Tests
Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012
The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…
Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement
French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010
The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…
Descriptors: Test Bias, Testing Programs, Evaluation, Measurement
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
Huang, Jinyan – TESOL Journal, 2011
Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…
Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs
Pellegrino, James W. – Journal of Research in Science Teaching, 2012
Beginning with a reference to living in a time of both uncertainty and opportunity, this article presents a discussion of key areas where shared understanding is needed if we are to successfully realize the design and use of high quality, valid assessments of science. The key areas discussed are: (1) assessment purpose and use, (2) the nature of…
Descriptors: Science Education, Science and Society, Academic Standards, State Standards