Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Buick, J. M. – European Journal of Physics Education, 2011
Aspects of assessment in physics are considered with the aim of designing assessments that will encourage a deep approach to student learning and will ultimately lead to higher levels of achievement. A range of physics questions are considered and categorized by the level of knowledge and understanding which is require for a successful answer.…
Descriptors: Physics, Taxonomy, Science Achievement, Knowledge Level
van Hartingsveldt, Margo J.; de Groot, Imelda J. M.; Aarts, Pauline B. M.; Nijhuis-van der Sanden, Maria W. G. – Developmental Medicine & Child Neurology, 2011
Aim: To establish if there are psychometrically sound standardized tests or test items to assess handwriting readiness in 5- and 6-year-old children on the levels of occupations activities/tasks and performance. Method: Electronic databases were searched to identify measurement instruments. Tests were included in a systematic review if: (1)…
Descriptors: Writing Readiness, Test Items, Handwriting, Standardized Tests
Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011
Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…
Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)
Miller, Michael B.; Guerin, Scott A.; Wolford, George L. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2011
The false memory effect produced by the Deese/Roediger & McDermott (DRM) paradigm is reportedly impervious to warnings to avoid false alarming to the critical lures (D. A. Gallo, H. L. Roediger III, & K. B. McDermott, 2001). This finding has been used as strong evidence against models that attribute the false alarms to a decision…
Descriptors: Models, Memory, Recognition (Psychology), Test Items
Charlton, Shawn R.; Gossett, Bradley D.; Charlton, Veda A. – Psychological Record, 2011
Temporal discounting, the loss in perceived value associated with delayed outcomes, correlates with a number of personality measures, suggesting that an item-level analysis of trait measures might provide a more detailed understanding of discounting. The current report details two studies that investigate the utility of such an item-level…
Descriptors: Personality Measures, Test Items, Item Analysis, Delay of Gratification
Fukuhara, Hirotaka; Kamata, Akihito – Applied Psychological Measurement, 2011
A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
Descriptors: Item Response Theory, Test Bias, Test Items, Bayesian Statistics
Oliveri, Maria E.; Ercikan, Kadriye – Applied Measurement in Education, 2011
In this study, we examine the degree of construct comparability and possible sources of incomparability of the English and French versions of the Programme for International Student Assessment (PISA) 2003 problem-solving measure administered in Canada. Several approaches were used to examine construct comparability at the test- (examination of…
Descriptors: Foreign Countries, English, French, Tests
Jones, Andrew T. – Applied Psychological Measurement, 2011
Practitioners often depend on item analysis to select items for exam forms and have a variety of options available to them. These include the point-biserial correlation, the agreement statistic, the B index, and the phi coefficient. Although research has demonstrated that these statistics can be useful for item selection, no research as of yet has…
Descriptors: Test Items, Item Analysis, Cutting Scores, Statistics
Liu, Jinghua; Sinharay, Sandip; Holland, Paul; Feigenbaum, Miriam; Curley, Edward – Educational and Psychological Measurement, 2011
Two different types of anchors are investigated in this study: a mini-version anchor and an anchor that has a less spread of difficulty than the tests to be equated. The latter is referred to as a midi anchor. The impact of these two different types of anchors on observed score equating are evaluated and compared with respect to systematic error…
Descriptors: Equated Scores, Test Items, Difficulty Level, Statistical Bias
Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014
Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…
Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)
Kortemeyer, Gerd – Physical Review Special Topics - Physics Education Research, 2014
Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for…
Descriptors: Item Response Theory, Online Courses, Electronic Learning, Homework
Cress, Cynthia J.; Lambert, Matthew C.; Epstein, Michael H. – Journal of Early Intervention, 2014
The Preschool Behavioral and Emotional Rating Scale (PreBERS) is an assessment of emotional and behavioral strengths in preschoolers with well-established reliability and validity for educational and clinical application in children with and without disabilities. The present study provides further evidence of psychometric rigor for items and…
Descriptors: Preschool Children, Rating Scales, Child Behavior, Behavior Problems
Thompson, James R.; Wehmeyer, Michael L.; Hughes, Carolyn; Shogren, Karrie A.; Palmer, Susan B.; See, Hyojeong – Grantee Submission, 2014
This article introduces the Supports Intensity Scale-Children's Version (SIS-C) designed and normed to be used with children across multiple contexts, including home, school, and community life. Steps taken to develop the scale are described, and findings from data collected on a field test version of the SIS-C are shared. Preliminary findings in…
Descriptors: Test Validity, Test Reliability, Children, Test Construction
Deplazes, Svetlana P. – ProQuest LLC, 2014
The purpose of this study was to examine the overall level of student achievement on the 2012 Kansas History-Government Assessment in Grades 6, 8, and high school, with major emphasis on the subject area of economics. It explored four specific research questions in order to: (1) determine the level of student knowledge of assessed economic…
Descriptors: Economics, Social Studies, Comparative Analysis, Academic Achievement
Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013
Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)

Peer reviewed
Direct link
