Publication Date
In 2025 | 3 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 37 |
Since 2006 (last 20 years) | 97 |
Descriptor
Source
Author
Herman, Joan L. | 4 |
Harlen, Wynne | 3 |
Brookhart, Susan M. | 2 |
Cooper, Melanie M. | 2 |
Darling-Hammond, Linda | 2 |
Dietel, Ronald | 2 |
Ediger, Marlow | 2 |
Goldschmidt, Pete | 2 |
Heritage, Margaret | 2 |
Kenyon, Dorry | 2 |
Knight, Peter T. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 13 |
Teachers | 11 |
Practitioners | 10 |
Administrators | 4 |
Policymakers | 3 |
Counselors | 1 |
Location
United Kingdom | 10 |
United Kingdom (England) | 6 |
Australia | 4 |
Vermont | 4 |
Florida | 3 |
New York | 3 |
California | 2 |
Canada | 2 |
Connecticut | 2 |
Massachusetts | 2 |
New Hampshire | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 6 |
No Child Left Behind Act 2001 | 6 |
Elementary and Secondary… | 2 |
Education of the Handicapped… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Scott J. Peters; Matthew C. Makel; Lindsay Ellis Lee; Tamra Stambaugh; Matthew T. McBee; D. Betsy McCoach; Kiana R. Johnson – Gifted Child Today, 2024
Universal screening is one of the most-common topics and well-accepted best practices within the field of gifted and talented education. There appears to be little disagreement that universally screening all students as part of a gifted and talented identification process results in fewer missed students. But surprisingly, there is little guidance…
Descriptors: Academically Gifted, Talent Identification, Screening Tests, Test Validity
Emily L. Coderre – College Teaching, 2024
Psychometrics is the field of designing tests and assessments to measure certain psychological concepts. It is chiefly concerned with two fundamental properties: reliability and validity. These properties are often influenced by confounding variables: other things that can influence performance but are not what you are trying to measure. Here, I…
Descriptors: Teaching Methods, Psychometrics, Test Construction, Test Reliability
Anne Wicks; Robin Berkley – George W. Bush Institute, 2025
Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…
Descriptors: Student Evaluation, Testing, Tests, Standardized Tests
Williamson, Joanna; Child, Simon – Journal of Vocational Education and Training, 2022
School- and college-based vocational and technical qualifications (VTQs) in England are required to award successful candidates a grade rather than simple pass or fail. Ensuring the reliability and validity of these grades is considered vital, particularly in light of the high-stakes purposes for which school assessment results in England are…
Descriptors: Foreign Countries, Vocational Education, Qualifications, Student Evaluation
Devender R. Banda; Stephanie L. Hart – Education and Training in Autism and Developmental Disabilities, 2025
The BCBA Task List (5th and 6th Edition) requires behavior analysts are familiar with multiple assessment methods, including tools to assess the function of problem behaviors and assess skill strengths and deficits. Although several studies exist on the assessment of functions using indirect functional assessments, descriptive functional…
Descriptors: Applied Behavior Analysis, Behavior Problems, Capital (Sociology), Student Characteristics
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Dolin, Jens; Black, Paul; Harlen, Wynne; Tiberghien, Andrée – Contributions from Science Education Research, 2018
This chapter characterises the two key purposes of assessment, formative and summative, within a general model of assessment of student learning. It discusses reliability and validity issues in relation to the two purposes and considers formative and summative purposes as related and can be brought together in developing a dependable approach to…
Descriptors: Formative Evaluation, Summative Evaluation, Student Evaluation, Accountability
Flanagan, Agnes; Cormier, Damien C. – Communique, 2019
One of the areas subsumed under the data-based decision making and accountability practice identified in the National Association of School Psychologists' (NASP) "Model for Integrated School Psychological Services" is to collect information on psychological and educational variables to make decisions at a number of levels of service…
Descriptors: Test Bias, School Psychologists, Measurement, Data Collection
Stephenson, Norda S.; Duffy, Erin M.; Day, Elizabeth L.; Padilla, Kira; Herrington, Deborah G.; Cooper, Melanie M.; Carmel, Justin H. – Journal of Chemical Education, 2020
The development of proficiency in the practices used by scientists and engineers is considered an important student outcome of laboratory instruction. We developed tasks to assess students' use and development of selected scientific and engineering practices in the general chemistry laboratory using an adapted evidence-centered design approach. In…
Descriptors: Test Construction, Test Validity, Chemistry, Science Process Skills
Shaw, Stuart; Nisbet, Isabel – Research Matters, 2021
Was the approach proposed for calculating exam grades in summer 2020 fair? Were the grades eventually awarded (after policy changes) fair? What is a fair arrangement for 2021? These questions have been at the heart of debate in the UK in the light of COVID-19. After schools were closed in the spring of 2020 and the decision was made not to proceed…
Descriptors: COVID-19, Pandemics, Student Evaluation, Evaluation Methods
Center on Standards and Assessments Implementation, 2018
Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…
Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias
Grapin, Sally L.; Benson, Nicholas F. – Contemporary School Psychology, 2019
The Every Student Succeeds Act (ESSA) aims to ensure that all students are college- and career-ready by requiring all schools to implement high-quality accountability systems and services for students. The ESSA impacts assessment practices in schools by requiring staff to account for a broader range of variables related to student well-being,…
Descriptors: Educational Legislation, Federal Legislation, Elementary Secondary Education, Evaluation Methods
Fitzgerald, Jill; Shanahan, Timothy E. – International Literacy Association, 2020
Reading scores exist for a continuum of purposes, from informal assessment to formal standardized tests. This brief aims to answer the question: What matters most for elementary-grade teachers when thinking about reading scores, and what could policymakers do to help teachers? Three positions worth pursuing in this regard are shared: (1) every…
Descriptors: Reading Achievement, Scores, Elementary School Students, Elementary School Teachers
Regional Educational Laboratory Central, 2021
This Study Snapshot highlights key findings from a larger study examining the validity and reliability of the Kansas Clinical Assessment Tool (K-CAT), a newly developed tool for assessing the performance of teacher candidates. The study team used interviews with cooperating teachers, content experts' ratings of the alignment of the K-CAT to…
Descriptors: Test Validity, Test Reliability, Evaluation Methods, Preservice Teachers