Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 15 |
Descriptor
Source
Author
Ceder, Ineke | 1 |
Charmaraman, Linda | 1 |
Clarke, Ben | 1 |
Falk, Beverly | 1 |
Flood, Mirjam | 1 |
Gardner, John | 1 |
Gersten, Russell M. | 1 |
Hair, Elizabeth | 1 |
Halle, Tamara | 1 |
Hamilton, Laura S. | 1 |
Haymond, Kelly | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 16 |
Journal Articles | 11 |
Education Level
Elementary Secondary Education | 6 |
Higher Education | 6 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
High Schools | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
Japan | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Early Childhood Environment… | 1 |
Infant Toddler Environment… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015
The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…
Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship
Stoynoff, Stephen – Language Teaching, 2012
In a recent state-of-the-art (SoA) article (Stoynoff 2009), I reviewed some of the trends in language assessment research and considered them in light of validation activities associated with four widely used international measures of L2 English ability. This Thinking Allowed article presents an opportunity to revisit the four broad areas of L2…
Descriptors: English (Second Language), Language Tests, Evaluation Research, Test Validity
Karami, Hossein – Educational Research and Evaluation, 2013
The search for fairness in language testing is distinct from other areas of educational measurement as the object of measurement, that is, language, is part of the identity of the test takers. So, a host of issues enter the scene when one starts to reflect on how to assess people's language abilities. As the quest for fairness in language testing…
Descriptors: Language Skills, Language Tests, Testing, Culture Fair Tests
Gersten, Russell M.; Clarke, Ben; Jordan, Nancy C.; Newman-Gonchar, Rebecca; Haymond, Kelly; Wilkins, Chuck – Grantee Submission, 2012
This article describes key findings from contemporary research on screening for early primary grade students in the area of mathematics. Existing studies were used to illustrate the constructs most worth measuring and the diverse strategies that researchers used to study potential measures. The authors discussed the strengths and weaknesses of…
Descriptors: Primary Education, Screening Tests, Predictive Validity, Correlation
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Nulty, Duncan D. – Assessment & Evaluation in Higher Education, 2011
This paper reviews the literature about peer and self-assessment in university courses from the point of view of their use, and the suitability of their use, in the first year of university study. The paper is divided into three parts. The first part argues that although first-year students are involved in many of the studies that report on the…
Descriptors: College Freshmen, Program Effectiveness, Literature Reviews, Self Evaluation (Individuals)
Tanaka, Koji – Educational Studies in Japan: International Yearbook, 2009
The recent "Nationwide academic achievement and study situation survey" was clearly influenced by the idea of "authentic assessment", an educational assessment perspective focused on "quality" and "engagement". However, when "performance assessment", the assessment method corresponding to this…
Descriptors: Educational Assessment, Performance Based Assessment, Academic Achievement, Educational Research
Young, John W. – Educational Assessment, 2009
In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…
Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education
Educational Testing Service, 2010
This document describes the breadth of the research that the ETS (Educational Testing Service) Research & Development division is conducting in 2010. This portfolio will be updated in early 2011 to reflect changes to existing projects and new projects that were added after this document was completed. The research described in this portfolio falls…
Descriptors: Portfolios (Background Materials), Testing Programs, Educational Testing, Private Agencies
Koretz, Daniel M.; Hamilton, Laura S. – 2003
Previous studies of the validity of gains on high-stakes tests have compared trends in scores on a high-stakes test to trends on a lower-stakes test, such as NAEP. However, generalizability of gains is likely to be incomplete even when gains are meaningful because of differences in the inferences the two tests are designed to support. Therefore,…
Descriptors: Educational Assessment, Evaluation Research, High Stakes Tests, Test Validity
Educational Testing Service, 2008
This document describes the breadth of the research being conducted in 2008 by the Research and Development Division at Educational Testing Service (ETS). The research described falls into three large categories: (1) Research supported by the ETS research allocation; (2) Research funded by testing programs at ETS; and (3) Research funded by…
Descriptors: Research and Development, Testing Programs, Educational Testing, Educational Research
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Lin, Jie – Alberta Journal of Educational Research, 2006
The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…
Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research
Previous Page | Next Page »
Pages: 1 | 2