Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 9 |
Descriptor
Source
Author
| Minnema, Jane E. | 2 |
| Thurlow, Martha L. | 2 |
| Beth E. Schueler | 1 |
| Chow, Kui Foon | 1 |
| Desa, Deana | 1 |
| Dorans, Neil J. | 1 |
| Finch, W. Holmes | 1 |
| French, Brian F. | 1 |
| Haertel, Edward H. | 1 |
| Hastedt, Dirk | 1 |
| Huang, Jinyan | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 12 |
| Journal Articles | 5 |
| Numerical/Quantitative Data | 2 |
Education Level
| Elementary Education | 4 |
| Elementary Secondary Education | 4 |
| Secondary Education | 4 |
| Middle Schools | 3 |
| Early Childhood Education | 2 |
| Grade 4 | 2 |
| Grade 8 | 2 |
| High Schools | 2 |
| Intermediate Grades | 2 |
| Junior High Schools | 2 |
| Grade 12 | 1 |
| More ▼ | |
Audience
Location
| Arizona | 1 |
| Botswana | 1 |
| California | 1 |
| Canada | 1 |
| Ghana | 1 |
| Honduras | 1 |
| Hong Kong | 1 |
| Indonesia | 1 |
| Missouri | 1 |
| South Korea | 1 |
| Taiwan | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Trends in International… | 2 |
| SAT (College Admission Test) | 1 |
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Luke C. Miller; Beth E. Schueler – Grantee Submission, 2022
COVID-19 significantly impacted the educational and home environments for students throughout the Commonwealth of Virginia in ways that may have impacted student learning, as measured by standardized exams. We analyze statewide, student-level administrative and assessment data on the Standards of Learning (SOL) reading and math tests for the…
Descriptors: COVID-19, Pandemics, Public Schools, Measurement
Raudonyte, Ieva – UNESCO International Institute for Educational Planning, 2021
Although the number of countries conducting large-scale assessments has increased significantly over the past two decades, this has not necessarily led to the effective use of learning assessment data in policy-making and planning. To better understand the reasons for this, the UNESCO International Institute for Educational Planning (IIEP)…
Descriptors: Foreign Countries, Measurement, Data Use, Educational Planning
Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014
International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…
Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis
Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012
The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…
Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement
French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010
The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…
Descriptors: Test Bias, Testing Programs, Evaluation, Measurement
Huang, Jinyan – TESOL Journal, 2011
Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…
Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs
Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011
This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…
Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement
Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009
The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…
Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics
Haertel, Edward H. – US Department of Education, 2004
Large-scale testing programs often require multiple forms to maintain test security over time or to enable the measurement of change without repeating the identical questions. The comparability of scores across forms is consequential: Students are admitted to colleges based on their test scores, and the meaning of a given scale score one year …
Descriptors: Measurement, Testing Programs, Equated Scores, Test Use
Minnema, Jane E.; Thurlow, Martha L.; Warren, Sandra Hopfengardner – National Center on Educational Outcomes, 2004
This report is an accounting of a second case study of large-scale assessment practices in a local educational agency where students with disabilities were administered the state's standards-based tests out of level. The first report (Minnema et al., 2004a) provided the results from the first case study conducted in another school district in…
Descriptors: Achievement Tests, Instructional Program Divisions, Student Attitudes, Measurement
Minnema, Jane E.; Thurlow, Martha L.; Van Getson, Gretchen R. – National Center on Educational Outcomes, 2004
Proponents of out-of-level testing generally contend that there are three benefits for students with disabilities: (1) undue test frustration is avoided; (2) test measurement accuracy is improved; and (3) test items are better matched to students' current educational goals and instructional level (Thurlow, Elliott, & Ysseldyke, 1999). It is…
Descriptors: Student Evaluation, Instructional Program Divisions, Teacher Attitudes, Measurement

Peer reviewed
Direct link
