NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 75 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Andres De Los Reyes; Mo Wang; Matthew D. Lerner; Bridget A. Makol; Olivia M. Fitzpatrick; John R. Weisz – Grantee Submission, 2022
Researchers strategically assess youth mental health by soliciting reports from multiple informants. Typically, these informants (e.g., parents, teachers, youth themselves) vary in the social contexts where they observe youth. Decades of research reveal that the most common data conditions produced with this approach consist of discrepancies…
Descriptors: Mental Health, Measurement Techniques, Evaluation Methods, Research
Peer reviewed Peer reviewed
Direct linkDirect link
Koretz, Daniel – Assessment in Education: Principles, Policy & Practice, 2016
Daniel Koretz is the Henry Lee Shattuck Professor of Education at the Harvard Graduate School of Education. His research focuses on educational assessment and policy, particularly the effects of high-stakes testing on educational practice and the validity of score gains. He is the author of "Measuring Up: What Educational Testing Really Tells…
Descriptors: Test Validity, Definitions, Evidence, Relevance (Education)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Parmin; Erna Noor Savitri; Yahya Nur Ifriza – South African Journal of Education, 2025
With the research reported here, we specifically aim to develop application products of scientific work independence instruments through science integrated learning (SIL) for various education levels (elementary schools, junior high schools, senior high schools, and universities). The SIL model was applied in learning to determine specific…
Descriptors: Elementary School Students, Secondary School Students, College Students, Student Development
Peer reviewed Peer reviewed
Direct linkDirect link
Cillessen, Antonius H. N.; Marks, Peter E. L. – New Directions for Child and Adolescent Development, 2017
Although peer nomination measures have been used by researchers for nearly a century, common methodological practices and rules of thumb (e.g., which variables to measure; use of limited vs. unlimited nomination methods) have continued to develop in recent decades. At the same time, other key aspects of the basic nomination procedure (e.g.,…
Descriptors: Peer Relationship, Research Methodology, Decision Making, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Huber, Stephan Gerhard; Helm, Christoph – Educational Assessment, Evaluation and Accountability, 2020
The crisis caused by the COVID-19 virus has far-reaching effects in the field of education, as schools were closed in March 2020 in many countries around the world. In this article, we present and discuss the School Barometer, a fast survey (in terms of reaction time, time to answer and dissemination time) that was conducted in Germany, Austria…
Descriptors: Disease Control, School Closing, Educational Policy, Surveys
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Hall, S. S.; Hammond, J. L.; Hirt, M.; Reiss, A. L. – Journal of Intellectual Disability Research, 2012
Background: Clinical trials of medications to alleviate the cognitive and behavioural symptoms of individuals with fragile X syndrome (FXS) are now underway. However, there are few reliable, valid and/or sensitive outcome measures available that can be directly administered to individuals with FXS. The majority of assessments employed in clinical…
Descriptors: Outcome Measures, Test Validity, Feedback (Response), Reinforcement
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Kittleson, Howard M.; Roscoe, John T. – 1972
This study compares the relative power and robustness of the chi-square and Kolmogorov statistics with both the linear score scale and equal areas models. It is limited to the situation in which the mean and standard deviation are fixed by the hypothesis (a necessary constraint with the Kolmogorov tests). Two tables are presented which report the…
Descriptors: Comparative Testing, Goodness of Fit, Hypothesis Testing, Measurement Techniques
Green, Donald Ross; Draper, John F. – 1972
This paper considers the question of bias in group administered academic achievement tests, bias which is inherent in the instruments themselves. A body of data on the test of performance of three disadvantaged minority groups--northern, urban black; southern, rural black; and, southwestern, Mexican-Americans--as tryout samples in contrast to…
Descriptors: Achievement Tests, Bias, Comparative Testing, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Mueller, Karsten; Liebig, Christian; Hattrup, Keith – Educational and Psychological Measurement, 2007
Two quasi-experimental field studies were conducted to evaluate the psychometric equivalence of computerized and paper-and-pencil job satisfaction measures. The present research extends previous work in the area by providing better control of common threats to validity in quasi-experimental research on test mode effects and by evaluating a more…
Descriptors: Psychometrics, Field Studies, Job Satisfaction, Computer Assisted Testing
Peer reviewed Peer reviewed
Nevid, Jeffrey S. – Journal of Consulting and Clinical Psychology, 1983
Responds to an article questioning the construct validity of the Beck Hopelessness Scale. Suggests that social desirability should not be invoked as a potential confound unless the obtained covariation is theoretically inconsistent or is so overlapping as to make the respective scales redundant with respect to factorial content. (Author/RC)
Descriptors: Opinions, Psychological Testing, Research Methodology, Social Influences
Peer reviewed Peer reviewed
Linehan, Marsha M.; Nielsen, Stevan L. – Journal of Consulting and Clinical Psychology, 1983
States that the correlation between social desirability (SD) and hopelessness is open to several interpretations, but, in the absence of further data, concern for false-negative rates dictates caution in interpreting hopelessness scores when assessing suicidal behavior. Presents data on the relationship of SD, hopelessness, and prediction of…
Descriptors: Opinions, Predictive Measurement, Psychological Testing, Research Methodology
Ma, Lingling; Cronin, John – Northwest Evaluation Association, 2009
Virtual Comparison Groups (VCG) were developed by the Northwest Evaluation Association as an alternative to conventional controlled experiments for social science researchers working in the field of education. The VCG is generally a group of up to 51 students who are matched, based on key characteristics of the student and school, to a single…
Descriptors: Social Science Research, Comparative Analysis, Sampling, Student Characteristics
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5