Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
van der Linden, Wim J.; Veldkamp, Bernard P.; Carlson, James E. – Applied Psychological Measurement, 2004
A popular design in large-scale educational assessments as well as any other type of survey is the balanced incomplete block design. The design is based on an item pool split into a set of blocks of items that are assigned to sets of "assessment booklets." This article shows how the problem of calculating an optimal balanced incomplete block…
Descriptors: Grade 8, National Competency Tests, Item Banks, Research Design
Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004
Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…
Descriptors: Test Length, Test Bias, Simulation, Item Response Theory
He, Q.; Tymms, P. – Journal of Computer Assisted Learning, 2005
Computer-assisted assessment (CAA) has become increasingly important in education in recent years. A variety of computer software systems have been developed to help assess the performance of students at various levels. However, such systems are primarily designed to provide objective assessment of students and analysis of test items, and focus…
Descriptors: Foreign Countries, Computer Assisted Testing, Test Construction, Test Results
McDaniel, Michael A.; Whetzel, Deborah L. – Intelligence, 2005
[Gottfredson, L. S. (2003). Dissecting practical intelligence theory: Its claims and evidence. Intelligence, 31, 343-397.] provided a detailed critique of Sternberg's [Sternberg, R. J., Fotsythe, G. B., Hedlund, J., Horvath, J. A., Wagner, R. K., Williams, W. M., Snook, S. A., Grigorenko, E. L. (2000). Practical intelligence in everyday life. New…
Descriptors: Individual Testing, Test Format, Test Items, Personnel Selection
Newgent, Rebecca A.; Lee, Sang Min; Higgins, Kristin K.; Mulvenon, Sean W.; Connors, Joanie V. – Journal of Educational Research & Policy Studies, 2004
The Revised NEO Personality Inventory (NEO PI-R) was developed to operationalize the Five-Factor Model of Personality. Using correlational analysis and confirmatory and exploratory factor analysis, the present study investigates the facet structure of the domain of Agreeableness of the NEO-PI-R at the facet and item level to assess which is a more…
Descriptors: Personality Traits, Personality, Factor Analysis, Evaluation Research
Loewen, Shawn – Studies in Second Language Acquisition, 2005
Incidental focus on form overtly draws learners' attention to linguistic items as they arise spontaneously--without prior planning--in meaning-focused interaction. This study examined the effectiveness of incidental focus on form in promoting second language (L2) learning. Seventeen hours of naturally occurring, meaning-focused L2 lessons were…
Descriptors: Second Language Learning, Foreign Countries, Test Items, Recall (Psychology)
Osborne, Jason W. – Electronic Journal of Research in Educational Psychology, 2006
Introduction: Claude Steele's stereotype threat hypothesis proposed that negative group stereotypes increase individual anxiety levels, hurting performance. However, the role of anxiety in stereotype threat has not been fully explored. This study examined the hypothesis that experimental manipulation of stereotype threat would influence real-time…
Descriptors: Test Items, Stereotypes, Females, Mathematics Tests
Wu, Margaret – Education Journal, 2003
This article presents key features of the mathematics assessment in OECD's Programme for International Student Assessment (PISA), from the point of view of the design of test items to fit in with the PISA mathematics framework. A brief description is first given to provide some background to the development of the PISA mathematics framework. The…
Descriptors: Mathematics Education, Test Items, Academic Achievement, Foreign Countries
Katayama, Andrew D.; Crooks, Steven M. – Journal of Experimental Education, 2003
The authors investigated in this study the effects of two electronic notes conditions (complete vs. partial) and two testing conditions (immediate vs. delayed) on three types of tests (fact, structure, and application). A 2 x 2 factorial multivariate analysis of variance (MANOVA) yielded no significant main effects for notes conditions on the fact…
Descriptors: Testing, Multivariate Analysis, Graduate Students, Notetaking
Rieck, William A. – Principal Leadership, 2006
Student assessment has long been a major component of the tasks that teachers perform. As such, it is important that school leaders consider teachers' assessment strategies as part of the normal supervisory process. In a political climate ruled by the No Child Left Behind Act, one important consideration is how well teachers' assessments prepare…
Descriptors: Federal Legislation, Test Items, Academic Achievement, Standardized Tests
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2006
A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the two-parameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity and discriminating power of each item. It is…
Descriptors: Test Items, Vocational Aptitude, Reaction Time, Markov Processes
Dudley, Albert – Language Testing, 2006
This study examined the multiple true-false (MTF) test format in second language testing by comparing multiple-choice (MCQ) and multiple true-false (MTF) test formats in two language areas of general English: vocabulary and reading. Two counter-balanced experimental designs--one for each language area--were examined in terms of the number of MCQ…
Descriptors: Second Language Learning, Test Format, Validity, Testing
Baxter, G. P.; Ahmed, S.; Sikali, E.; Waits, T.; Sloan, M.; Salvucci, S. – National Center for Education Statistics, 2007
In 2003, a trial National Assessment of Educational Progress (NAEP) mathematics assessment was administered in Spanish to public school students at grades 4 and 8 in Puerto Rico. Based on preliminary analyses of the 2003 data, changes were made in administration and translation procedures for the 2005 NAEP administration in Puerto Rico. This…
Descriptors: Foreign Countries, Grade 4, Grade 5, Grade 6
Cominole, Melissa; Wheeless, Sara; Dudley, Kristin; Franklin, Jeff; Wine, Jennifer – National Center for Education Statistics, 2007
The "2004/06 Beginning Postsecondary Students Longitudinal Study (BPS:04/06)" is sponsored by the U.S. Department of Education to respond to the need for a national, comprehensive database concerning issues students may face in enrollment, persistence, progress, and attainment in postsecondary education and in consequent early rates of…
Descriptors: Postsecondary Education, Stopouts, Research Methodology, Data Collection
Bennett, Randy Elliot; Rock, Donald A. – 1993
Formulating-Hypotheses (F-H) items present a situation and ask the examinee to generate as many explanations for it as possible. This study examined the generalizability, validity, and examinee perceptions of a computer-delivered version of the task. Eight F-H questions were administered to 192 graduate students. Half of the items restricted…
Descriptors: Computer Assisted Testing, Difficulty Level, Generalizability Theory, Graduate Students

Peer reviewed
Direct link
