Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Comparative Analysis | 10 |
| Test Validity | 10 |
| Inferences | 9 |
| Scores | 7 |
| Foreign Countries | 3 |
| Standardized Tests | 3 |
| Test Reliability | 3 |
| Academic Achievement | 2 |
| Achievement Tests | 2 |
| Control Groups | 2 |
| Cultural Differences | 2 |
| More ▼ | |
Source
Author
| Calvo, Rosa | 1 |
| Castro-Formieles, Josefina | 1 |
| Chambers, Nola | 1 |
| Cheong, Loh Sau | 1 |
| Emick, Jessica | 1 |
| Ercikan, Kadriye | 1 |
| Kaland, Nils | 1 |
| Kopriva, Rebecca J. | 1 |
| LaFlair, Geoffrey T. | 1 |
| Lera-Miguel, Sara | 1 |
| Lázaro, Luisa | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 7 |
| Reports - Evaluative | 3 |
| Information Analyses | 1 |
Education Level
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Secondary Education | 2 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
Audience
| Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Communication and Symbolic… | 1 |
| Michigan Test of English… | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| Progress in International… | 1 |
What Works Clearinghouse Rating
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Lera-Miguel, Sara; Rosa, Mireia; Puig, Olga; Kaland, Nils; Lázaro, Luisa; Castro-Formieles, Josefina; Calvo, Rosa – Journal of Autism and Developmental Disorders, 2016
Most individuals with autism spectrum disorders often fail in tasks of theory of mind (ToM). However, those with normal intellectual functioning known as high functioning ASD (HF-ASD) sometimes succeed in mentalizing inferences. Some tools have been developed to more accurately test their ToM abilities. The aims of this study were to examine the…
Descriptors: Theory of Mind, Autism, Pervasive Developmental Disorders, Severity (of Disability)
Chambers, Nola; Stronach, Sheri T.; Wetherby, Amy M. – International Journal of Language & Communication Disorders, 2016
Background: Substantial development in social communication skills occurs in the first two years of life. Growth should be evident in sharing emotion and eye gaze; rate of communication, communicating for a variety of functions; using gestures, sounds and words; understanding language, and using functional and pretend actions with objects in play.…
Descriptors: Foreign Countries, Interpersonal Communication, Interpersonal Competence, Communication Skills
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Rios, Joseph A.; Sireci, Stephen G. – International Journal of Testing, 2014
The International Test Commission's "Guidelines for Translating and Adapting Tests" (2010) provide important guidance on developing and evaluating tests for use across languages. These guidelines are widely applauded, but the degree to which they are followed in practice is unknown. The objective of this study was to perform a…
Descriptors: Guidelines, Translation, Adaptive Testing, Second Languages
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Wiliam, Dylan – Educational Psychologist, 2010
This article explores the use of standardized tests to hold schools accountable. The history of testing for accountability is reviewed, and it is shown that currently between-school differences account for less than 10% of the variance in student scores, in part because the progress of individuals is small compared to the spread of achievement…
Descriptors: Testing, Standardized Tests, Accountability, Inferences
Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011
This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…
Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness
Subramaniam, Selva Ranee; Cheong, Loh Sau – Journal of Science and Mathematics Education in Southeast Asia, 2008
This study sought to explore the emotional intelligence of Form One mathematics and science teachers. The emotional intelligence of the teachers was determined using the Emotional Intelligence for Mathematics and Science Teachers (EIMST) survey instrument. It was adapted and adopted from related instruments and then pilot tested for validity and…
Descriptors: Emotional Intelligence, Teaching Methods, Science Teachers, Mathematics Teachers
Kopriva, Rebecca J.; Wiley, David E.; Emick, Jessica – Online Submission, 2007
The goal of the current study was to examine the influence of providing more optimal testing conditions and evaluate the effect this has on the validity of the score inferences across ELL students with different needs, strengths, and levels of language proficiency. It was expected that the validity of the score inferences would be similar for 3rd…
Descriptors: Grade 5, Test Format, Inferences, Test Validity

Peer reviewed
Direct link
