Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Szarko, Julia E.; Brown, Alec J.; Watkins, Marley W. – Journal of Applied School Psychology, 2013
The authors examined the difference in standardized test performance when familiar versus unfamiliar examiners tested 26 preschool and elementary-aged children with autism. The children were matched by age, severity, and developmental level and then randomly placed into familiar and unfamiliar examiner groups. Familiarity with the examiner was…
Descriptors: Familiarity, Standardized Tests, Autism, Examiners
Doebler, Anna – Applied Psychological Measurement, 2012
It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Hahn, Jinsoo; Jang, Kyungho – Journal of Economic Education, 2012
International comparisons of economic understanding generally require a translation of a standardized test written in English into another language. Test results can differ based on how researchers translate the English written exam into one in their own language. To confirm this hypothesis, two differently translated versions of the "Basic…
Descriptors: Test Bias, Economics, Standardized Tests, Translation
Birnholz, Justin L.; Young, Michael A. – Assessment, 2012
This study assessed whether the Center for Epidemiological Studies Depression Scale (CES-D) functions equivalently in assessing depressive symptom severity in lesbian, bisexual, and heterosexual women. Using differential item functioning methods, the authors examined (a) whether there is a bias in CES-D total scores and in individual item scores…
Descriptors: Test Bias, Measures (Individuals), Depression (Psychology), Severity (of Disability)
Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016
Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…
Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items
Pedrosa, Ignacio; Suárez-Álvarez, Javier; Lozano, Luis M.; Muñiz, José; García-Cueto, Eduardo – Journal of Psychoeducational Assessment, 2014
Adolescence is a critical period of life during which significant psychosocial adjustment occurs and in which emotional intelligence plays an essential role. This article provides validity evidence for the Trait Meta-Mood Scale-24 (TMMS-24) scores based on an item response theory (IRT) approach. A sample of 2,693 Spanish adolescents (M = 16.52…
Descriptors: Foreign Countries, Adolescents, Secondary School Students, Emotional Intelligence
Zoanetti, Nathan; Les, Magdalena; Leigh-Lancaster, David – Mathematics Education Research Group of Australasia, 2014
From 2011-2013 the VCAA conducted a trial aligning the use of computers in curriculum, pedagogy and assessment culminating in a group of 62 volunteer students sitting their end of Year 12 technology-active Mathematical Methods (CAS) Examination 2 as a computer-based examination. This paper reports on statistical modelling undertaken to compare the…
Descriptors: Computer Assisted Testing, Comparative Analysis, Mathematical Concepts, Mathematics Tests
Hudley, Anne H. Charity; Mallinson, Christine – Cultural Studies of Science Education, 2017
Professional development on issues of language and culture is often separate from professional development on issues related to STEM education, resulting in linguistic and cultural gaps in K-12 STEM pedagogy and practice. To address this issue, we have designed a model of professional development in which we work with educators to build cultural…
Descriptors: Faculty Development, STEM Education, Workshops, Elementary School Teachers
Marcenaro-Gutierrez, Oscar; Vignoles, Anna – Educational Research, 2015
Background: Education systems rely on both teacher and test-based assessments. Where these assessments are used for summative purposes particularly, it is important to understand why, and for which groups of students, teachers' assessments may produce different results from test-based assessments. Purpose: This paper assesses whether the…
Descriptors: Foreign Countries, Elementary School Students, Secondary School Students, Student Evaluation
Bulut, Okan; Palma, Jose; Rodriguez, Michael C.; Stanke, Luke – SAGE Open, 2015
Noncognitive characteristics are gaining importance in addressing the persistent challenges facing youth in diverse settings. Measurement invariance of two youth developmental assets, Support and Positive Identity, is evaluated across grade levels and English language learner (ELL) subgroups of Latino students in 6th through 12th grade.…
Descriptors: Evaluation Methods, Grade 6, Grade 7, Grade 8
Taylor, Joseph; Kowalski, Susan; Wilson, Christopher; Getty, Stephen; Carlson, Janet – Journal of Research in Science Teaching, 2013
This paper focuses on the trade-offs that lie at the intersection of methodological requirements for causal effect studies and policies that affect how and to what extent schools engage in such studies. More specifically, current federal funding priorities encourage large-scale randomized studies of interventions in authentic settings. At the same…
Descriptors: Science Instruction, Research Methodology, Causal Models, Influences
Frank, Kenneth A.; Maroulis, Spiro J.; Duong, Minh Q.; Kelcey, Benjamin M. – Educational Evaluation and Policy Analysis, 2013
We contribute to debate about causal inferences in educational research in two ways. First, we quantify how much bias there must be in an estimate to invalidate an inference. Second, we utilize Rubin's causal model to interpret the bias necessary to invalidate an inference in terms of sample replacement. We apply our analysis to an inference…
Descriptors: Causal Models, Inferences, Research Methodology, Robustness (Statistics)
Yin, Liqun – ProQuest LLC, 2013
In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests due to their compelling features in a growth-based accountability context. However, selection of a practical and effective calibration/scaling method and proper understanding of issues with possible multidimensionality in the test data is critical to…
Descriptors: Item Response Theory, Scaling, Robustness (Statistics), Monte Carlo Methods
Bashkov, Bozhidar M.; Finney, Sara J. – Measurement and Evaluation in Counseling and Development, 2013
Traditional methods of assessing construct stability are reviewed and longitudinal mean and covariance structures (LMACS) analysis, a modern approach, is didactically illustrated using psychological entitlement data. Measurement invariance and latent variable stability results are interpreted, emphasizing substantive implications for educators and…
Descriptors: Statistical Analysis, Longitudinal Studies, Reliability, Psychological Patterns

Peer reviewed
Direct link
