Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Lombardi, Allison; Seburn, Mary; Conley, David; Snow, Eric – Online Submission, 2010
In alignment studies, expert raters evaluate assessment items against standards and ratings are used to compute various alignment indices. Questions about rater reliability, however, are often ignored or inadequately addressed. This paper reports the results of a generalizability theory study of cognitive demand and rigor ratings of assessment…
Descriptors: Generalizability Theory, Test Items, College Entrance Examinations, Readiness
End, Christian M.; Worthman, Shaye; Mathews, Mary Bridget; Wetterau, Katharina – Teaching of Psychology, 2010
College students participated in a study on the "psychology of note taking" during which they took notes on video content and later completed a multiple-choice test on the material. Researchers assigned 71 participants to either the ringing condition (the video was disrupted by a ringing cell phone) or the control condition (no cell phone rings…
Descriptors: Control Groups, Test Items, Multiple Choice Tests, Telecommunications
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Krebs, Saskia S.; Roebers, Claudia M. – British Journal of Educational Psychology, 2010
Background: From the perspective of self-regulated learning, the interplay between learners' individual characteristics and the context of testing have been emphasized for assessing learning outcomes. Aims: The present study examined metacognitive processes in children's test-taking behaviour and explored their impacts on performance. Further, it…
Descriptors: Control Groups, Cloze Procedure, Individual Characteristics, Metacognition
Roszkowski, Michael J.; Soven, Margot – Assessment & Evaluation in Higher Education, 2010
A questionnaire used in student evaluations of interdisciplinary courses during six semesters contained two Likert items stated in a direct negative mode which were embedded in a questionnaire (14-18 items) in which the remaining items were phrased in a direct positive mode. In the seventh semester and thereafter, the two negative items were…
Descriptors: Questionnaires, Student Evaluation, Likert Scales, Test Construction
Kim, Sangwon; Kim, Seock-Ho; Kamphaus, Randy W. – School Psychology Quarterly, 2010
Gender differences in aggression have typically been based on studies utilizing a mean difference method. From a measurement perspective, this method is inherently problematic unless an aggression measure possesses comparable validity across gender. Stated differently, establishing measurement invariance on the measure of aggression is…
Descriptors: Test Items, Females, Factor Analysis, Inferences
Veldkamp, Bernard P.; van der Linden, Wim J. – International Journal of Testing, 2008
In most operational computerized adaptive testing (CAT) programs, the Sympson-Hetter (SH) method is used to control the exposure of the items. Several modifications and improvements of the original method have been proposed. The Stocking and Lewis (1998) version of the method uses a multinomial experiment to select items. For severely constrained…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Methods
Gierl, Mark J.; Zhou, Jiawen; Alves, Cecila – Journal of Technology, Learning, and Assessment, 2008
An item model serves as an explicit representation of the variables in an assessment task. An item model includes the "stem", "options", and "auxiliary information". The "stem" is the part of an item which formulates context, content, and/or the question the examinee is required to answer. The "options" contain the alternative answers with one…
Descriptors: Classification, Test Items, Models, Test Construction
Joordens, Steve; Ozubko, Jason D.; Niewiadomski, Marty W. – Journal of Memory and Language, 2008
In his analysis of the pseudoword effect, [Greene, R.L. (2004). Recognition memory for pseudowords. "Journal of Memory and Language," 50, 259-267.] suggests nonwords can feel more familiar that words in a recognition context if the orthographic features of the nonword match well with the features of the items presented at study. One possible…
Descriptors: Test Items, Familiarity, Recognition (Psychology), Experimental Psychology
Cawthon, Stephanie W.; Winton, Samantha M.; Garberoglio, Carrie Lou; Gobble, Mark E. – Journal of Deaf Studies and Deaf Education, 2011
Students who are deaf or hard of hearing (SDHH) often need accommodations to participate in large-scale standardized assessments. One way to bridge the gap between the language of the test (English) and a student's linguistic background (often including American Sign Language [ASL]) is to present test items in ASL. The specific aim of this project…
Descriptors: Test Items, Partial Hearing, Deafness, Standardized Tests
Sedki, S. Sam – Journal of International Education Research, 2011
Most professors use examinations as an important assessment tool to aid in determining the level of student subject matter comprehension. We also use the feedback from examinations as an indicator of the appropriateness and effectiveness of the teaching methodologies we are utilizing in the classroom. This paper is a follow-up to a 2006-2007 study…
Descriptors: Tests, Comparative Analysis, Teaching Methods, Comparative Education
Vannest, Kimberly J.; Parker, Richard; Dyer, Nicole – Journal of Special Education, 2011
This article presents procedures and results from a 2-year project developing science key vocabulary (KV) short tests suitable for progress monitoring Grade 5 science in Texas public schools using computer-generated, -administered, and -scored assessments. KV items included KV definitions and important usages in a multiple-choice cloze format. A…
Descriptors: Grade 5, Low Achievement, Vocabulary, Science Tests
Singh, Delar K. – Online Submission, 2009
This survey explores the post-graduation outcomes of university students with disabilities. It gathers data on their employment, independent living, community participation/social integration, and supports received by adult disability agencies. It also captures their perceptions about their quality of life. (Contains 1 figure.) [This survey tool…
Descriptors: Quality of Life, Disabilities, Graduate Surveys, Followup Studies
MacInnes, Jann Marie Wise – ProQuest LLC, 2009
Multilevel data often exist in educational studies. The focus of this study is to consider differential item functioning (DIF) for dichotomous items from a multilevel perspective. One of the most often used methods for detecting DIF in dichotomously scored items is the Mantel-Haenszel log odds-ratio. However, the Mantel-Haenszel reduces the…
Descriptors: Test Bias, Simulation, Item Response Theory, Test Items
Gurtman, Michael B.; Lee, Debbiesiu L. – Psychological Assessment, 2009
The structure and magnitude of sex differences in interpersonal problems across several data sets were examined, guided by the interpersonal circumplex model and the structural summary method. Data were self-reported interpersonal difficulties, assessed with the 64-item version of the Inventory of Interpersonal Problems (IIP; L. M. Horowitz, S. E.…
Descriptors: Effect Size, Gender Differences, Interpersonal Relationship, Individual Characteristics

Peer reviewed
Direct link
