NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 27 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017
We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…
Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction
Peer reviewed Peer reviewed
Direct linkDirect link
Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016
As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…
Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yavuz, Aysun – International Education Studies, 2012
In this paper, the writer discusses the philosophical underpinnings of the two dominant research methods in social sciences; quantitative and qualitative paradigms. The natures of two paradigms are quite different so this leads many researchers to discuss these issues in a comparative way. This paper tackles the knowledge and understanding of…
Descriptors: Teacher Educators, Social Science Research, Research Methodology, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy
Kowal, Julie; Hassel, Emily Ayscue – Public Impact, 2010
For too long, performance measurement systems in education have failed to document and recognize real differences among educators. But a recent national push to use performance evaluations for critical personnel decisions has highlighted the shortcomings of the current systems and increased the urgency to dramatically improve them. As state and…
Descriptors: Teaching (Occupation), Teacher Evaluation, Performance Based Assessment, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kucuk, Funda; Walters, JoDee – ELT Journal, 2009
This article reports on a study of the validity and reliability of tests administered in an EFL university setting. The study addresses the question of how well face validity reflects more objective measures of the quality of a test, such as predictive validity and reliability. According to some researchers, face validity, defined as the surface…
Descriptors: Language Tests, Test Validity, Achievement Tests, English (Second Language)
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Coscarelli, William; Shrock, Sharon – Performance Improvement Quarterly, 2002
Discusses problems in using traditional measures of reliability for criterion-referenced tests (CRTs) and describes two approaches to reliability for CRTs: estimates sensitive to all measures of error; and estimates of consistency in test outcome. Compares the two approaches and proposes recommendations for interpretation and use. (Author/LRW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Measurement Techniques, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Bauer, Christopher F. – Journal of Chemical Education, 2008
The development of a 20-item semantic differential assessment instrument for measuring student attitudes toward the subject of chemistry is described (Attitude toward the Subject of Chemistry Inventory-ASCI). Instrument subscales and survey items pertain to interest and utility, anxiety, intellectual accessibility, emotional satisfaction, and…
Descriptors: Majors (Students), Student Attitudes, Semantics, Chemistry
Michaelides, Michalis P.; Haertel, Edward H. – Center for Research on Evaluation Standards and Student Testing CRESST, 2004
There is variability in the estimation of an equating transformation because common-item parameters are obtained from responses of samples of examinees. The most commonly used standard error of equating quantifies this source of sampling error, which decreases as the sample size of examinees used to derive the transformation increases. In a…
Descriptors: Test Items, Testing, Error Patterns, Interrater Reliability
Peer reviewed Peer reviewed
Fleenor, John – Measurement and Evaluation in Counseling and Development, 1986
Reviews the Sixteen Personality Factor Questionnaire and the Personal Career Development Profile as tools for vocational exploration and career development. Reliability and validity problems are reported, followed by a recommendation to use the Strong-Campbell Interest Inventory instead. (ABB)
Descriptors: Career Counseling, Comparative Analysis, Interest Inventories, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Howell, Scott L. – New Directions for Teaching and Learning, 2004
Although instructional methods are moving in ever greater number to a multimedia base, testing is not. What principles should be considered in correcting this misalignment?
Descriptors: Multimedia Instruction, Teaching Methods, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Arnold, Karl-Heinz – Zeitschrift fur Padagogik, 2001
Demonstrates that a high degree of fairness may be achieved in international comparative research on school achievement, using the Third International Mathematics and Science Study (TIMSS) as an example and employing the methods of advanced pedagogical-psychological diagnosis. Includes references. (CMK)
Descriptors: Comparative Analysis, Educational Quality, Educational Research, Elementary Secondary Education
Previous Page | Next Page ยป
Pages: 1  |  2