Publication Date
| In 2026 | 0 |
| Since 2025 | 74 |
| Since 2022 (last 5 years) | 509 |
| Since 2017 (last 10 years) | 1084 |
| Since 2007 (last 20 years) | 2603 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 169 |
| Practitioners | 49 |
| Teachers | 32 |
| Administrators | 8 |
| Policymakers | 8 |
| Counselors | 4 |
| Students | 4 |
| Media Staff | 1 |
Location
| Turkey | 173 |
| Australia | 81 |
| Canada | 79 |
| China | 72 |
| United States | 56 |
| Taiwan | 44 |
| Germany | 43 |
| Japan | 41 |
| United Kingdom | 39 |
| Iran | 37 |
| Indonesia | 35 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedChalifour, Clark L.; Powers, Donald E. – Journal of Educational Measurement, 1989
Content characteristics of 1,400 Graduate Record Examination (GRE) analytical reasoning items were coded for item difficulty and discrimination. The results provide content characteristics for consideration in extending specifications for analytical reasoning items and a better understanding of the construct validity of these items. (TJH)
Descriptors: College Entrance Examinations, Construct Validity, Content Analysis, Difficulty Level
Peer reviewedIlai, Doron; Willerman, Lee – Intelligence, 1989
Items showing sex differences on the revised Wechsler Adult Intelligence Scale (WAIS-R) were studied. In a sample of 206 young adults (110 males and 96 females), 15 items demonstrated significant sex differences, but there was no relationship of item-specific gender content to sex differences in item performance. (SLD)
Descriptors: Comparative Testing, Females, Intelligence Tests, Item Analysis
Peer reviewedRamsay, J. O.; Winsberg, S. – Psychometrika, 1991
A method is presented for estimating the item characteristic curve (ICC) using polynomial regression splines. Estimation of spline ICCs is described by maximizing the marginal likelihood formed by integrating ability over a beta prior distribution. Simulation results compare this approach with the joint estimation of ability and item parameters.…
Descriptors: Ability, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedHu, Weiping; Adey, Philip – International Journal of Science Education, 2002
Describes the development of a test of scientific creativity for use with secondary school students which was constructed on the basis of an analysis of meaning and aspects of scientific creativity. Reports that the scientific creativity of secondary school students increases with age and science ability is a necessary but not sufficient condition…
Descriptors: Ability, Age, Creativity, Evaluation Methods
Tornroos, Jukka – Studies in Educational Evaluation, 2005
Opportunity to learn is considered an important contributing factor in learning outcomes. In some of the latest international comparative studies of mathematics achievement, such as SIMS and TIMSS, painstaking efforts have been made to find out what the participating students' opportunities to learn mathematics had been. However, there have been…
Descriptors: Textbooks, Mathematics Achievement, Mathematics Instruction, Outcomes of Education
Rupp, Andre A. – International Journal of Testing, 2003
Item response theory (IRT) has become one of the most popular scoring frameworks for measurement data. IRT models are used frequently in computerized adaptive testing, cognitively diagnostic assessment, and test equating. This article reviews two of the most popular software packages for IRT model estimation, BILOG-MG (Zimowski, Muraki, Mislevy, &…
Descriptors: Test Items, Adaptive Testing, Item Response Theory, Computer Software
Ariel, Adelaide; Veldkamp, Bernard P.; van der Linden, Wim J. – Journal of Educational Measurement, 2004
Preventing items in adaptive testing from being over- or underexposed is one of the main problems in computerized adaptive testing. Though the problem of overexposed items can be solved using a probabilistic item-exposure control method, such methods are unable to deal with the problem of underexposed items. Using a system of rotating item pools,…
Descriptors: Computer Assisted Testing, Adaptive Testing, Item Banks, Test Construction
Beretvas, S. Natasha; Williams, Natasha J. – Journal of Educational Measurement, 2004
To assess item dimensionality, the following two approaches are described and compared: hierarchical generalized linear model (HGLM) and multidimensional item response theory (MIRT) model. Two generating models are used to simulate dichotomous responses to a 17-item test: the unidimensional and compensatory two-dimensional (C2D) models. For C2D…
Descriptors: Item Response Theory, Test Items, Mathematics Tests, Reading Ability
Hurley, Kolleen E.; Deal, William Paul – Mental Retardation: A Journal of Practices, Policy and Perspectives, 2006
"Malingering," the exaggeration or fabrication of physical and/or psychological symptoms, can threaten the psychological assessment process (American Psychiatric Association, 2000). To enhance the validity of psychological evaluations, researchers have studied trends in malingering and developed instruments for its detection (Rogers, Bagby, &…
Descriptors: Measures (Individuals), Personality Traits, Mental Retardation, Symptoms (Individual Disorders)
Seaton, Eleanor K.; Scottham, Krista Maywalt; Sellers, Robert M. – Child Development, 2006
Although the identity formation model is widely used to assess adolescent ethnic identity development, the model propositions have rarely been tested. The existence of the identity statuses (diffuse, foreclosed, moratorium, achieved), the proposed developmental trajectories, and whether youth in the achieved status report higher levels of…
Descriptors: Racial Identification, Well Being, Adolescents, African Americans
Eggen, Theo J. H. M.; Verschoor, Angela J. – Applied Psychological Measurement, 2006
Computerized adaptive tests (CATs) are individualized tests that, from a measurement point of view, are optimal for each individual, possibly under some practical conditions. In the present study, it is shown that maximum information item selection in CATs using an item bank that is calibrated with the one- or the two-parameter logistic model…
Descriptors: Adaptive Testing, Difficulty Level, Test Items, Item Response Theory
Waber, Dietmar – Education Canada, 2006
The article focuses on Fraser Institute's (Vancouver, British Columbia) "Report Card on Elementary Schools in British Columbia." The rating of a school is based on reading, writing, and numeracy levels for both Grade 4 and Grade 7 students, gender differences in reading and numeracy in Grade 7, and the percentage of students not meeting…
Descriptors: Foreign Countries, Examiners, Report Cards, Reading
Meier, Scott T. – American Journal of Evaluation, 2004
Despite evidence that the choice of dependent measures can significantly influence design sensitivity, many evaluators default to traditional measures that may be insensitive to intervention effects. This paper describes an innovative set of test development guidelines designed to select items and create aggregate scales that are better able to…
Descriptors: Psychometrics, Item Analysis, Test Construction, Measures (Individuals)
Wang, Wen-Chung; Wilson, Mark – Educational and Psychological Measurement, 2005
This study presents a procedure for detecting differential item functioning (DIF) for dichotomous and polytomous items in testlet-based tests, whereby DIF is taken into account by adding DIF parameters into the Rasch testlet model. Simulations were conducted to assess recovery of the DIF and other parameters. Two independent variables, test type…
Descriptors: Test Format, Test Bias, Item Response Theory, Item Analysis
Martin, Nadine; Ayala, Jennifer – Brain and Language, 2004
In the first part of this study, we investigated effects of item and task type on span performance in a group of aphasic individuals with word processing and STM deficits. Group analyses revealed significant effects of item on span performance with span being greater for digits than for words. We also investigated associations between subjects'…
Descriptors: Phonology, Short Term Memory, Aphasia, Correlation

Direct link
