Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedChen, Shu-Ying; Ankenmann, Robert D.; Chang, Hua-Hua – Applied Psychological Measurement, 2000
Compared five item selection rules with respect to the efficiency and precision of trait (theta) estimation at the early stages of computerized adaptive testing (CAT). The Fisher interval information, Fisher information with a posterior distribution, Kullback-Leibler information, and Kullback-Leibler information with a posterior distribution…
Descriptors: Adaptive Testing, Computer Assisted Testing, Estimation (Mathematics), Selection
Peer reviewedMorrison, Susan; Free, Kathleen Walsh – Journal of Nursing Education, 2001
Presents guidelines for developing multiple-choice tests to measure critical thinking in nursing. Explains the rationale for test items and describes item criteria, including measurement of cognition at the application level and above, multilogical thinking, and high level of discrimination. (Contains 38 references.) (SK)
Descriptors: Critical Thinking, Guidelines, Higher Education, Multiple Choice Tests
Peer reviewedSchott, G. R.; Bellin, W. – Evaluation & Research in Education, 2001
Developed an approach to account for the impact of item presentation on ensuing constructs in the development of two versions of a self-report measure, the Relational Concept Scale, that was tested with 978 adolescent students in the United Kingdom. Outlines benefits of developing two versions of the scale to protect against presentational bias.…
Descriptors: Adolescents, Foreign Countries, Statistical Bias, Test Construction
Peer reviewedJodoin, Michael G.; Gierl, Mark J. – Applied Measurement in Education, 2001
Developed a new classification method for the logistic regression (LR) procedure for differential item functioning (DIF) based on methods used in the Simultaneous Item Bias test and conducted a simulation study to determine if the effect size measure affects the Type I error and power rates for the LR DIF procedure. Results show that inclusion of…
Descriptors: Classification, Effect Size, Item Bias, Power (Statistics)
Peer reviewedEmbretson, Susan; Gorin, Joanna – Journal of Educational Measurement, 2001
Examines testing practices in: (1) the past, in which the traditional paradigm left little room for cognitive psychology principles; (2) the present, in which testing research is enhanced by principles of cognitive psychology; and (3) the future, in which the potential of cognitive psychology should be fully realized through item design.…
Descriptors: Cognitive Psychology, Construct Validity, Educational Research, Educational Testing
Peer reviewedMeijer, Rob R.; Nering, Michael L. – Applied Psychological Measurement, 1999
Provides an overview of computerized adaptive testing (CAT) and introduces contributions to this special issue. CAT elements discussed include item selection, estimation of the latent trait, item exposure, measurement precision, and item-bank development. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Selection
Peer reviewedBishop, N. Scott; Frisbie, David A. – Applied Measurement in Education, 1999
Studied the effects of overlapping some test items across consecutive test levels by using overlapping and nonoverlapping items with 834 prematched and 782 matched elementary school students and focusing on whether there is an effect on achievement test scores due to item familiarization. No effects were detected. (SLD)
Descriptors: Achievement Tests, Elementary Education, Elementary School Students, Scores
Peer reviewedReise, Steven P. – Applied Psychological Measurement, 2001
The second edition of "Computerized Adaptive Testing" contains new materials related to: (1) chapter 2, system design; (2) chapter 4, item response theory, item calibration, and proficiency estimation; and (3) chapter 10, caveats, pitfalls, and unexpected consequences. The book raises critical computerized adaptive testing research and application…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Test Construction
Peer reviewedKirisci, Levent; Hsu, Tse-chi; Yu, Lifa – Applied Psychological Measurement, 2001
Studied the effects of test dimensionality, theta distribution shape, and estimation program (BILOG, MULTILOG, or XCALIBRE) on the accuracy of item and person parameter estimates through simulation. Derived guidelines for estimating parameters of multidimensional test items using unidimensional item response theory models. (SLD)
Descriptors: Ability, Computer Software, Estimation (Mathematics), Item Response Theory
Peer reviewedZwick, Rebecca; Senturk, Deniz; Wang, Joyce; Loomis, Susan Cooper – Educational Measurement: Issues and Practice, 2001
Compared four mapping item methods using data from the physical science test of the National Assessment of Educational Progress and studied the opinions of science content area experts about the difficulty of the items through a survey completed by 148 science teachers or scientists. Results of model-based mapping methods were more concordant with…
Descriptors: Comparative Analysis, Physical Sciences, Science Teachers, Science Tests
Johnson, Carol; Vanneman, Alan – Education Statistics Quarterly, 2001
Describes fourth graders' performance on 30 questions from the National Assessment of Educational Progress 1998 Civics Assessment, showing questions answered correctly by at least 75%, more than 50%, more than 25%, and fewer than 25%. Includes samples of students' written responses. (Author/SLD)
Descriptors: Citizenship Education, Civics, Elementary School Students, Grade 4
Johnson, Carol; Vanneman, Alan – Education Statistics Quarterly, 2001
Describes 12th graders' performance on 38 questions from the National Assessment of Educational Progress 1998 Civics Assessment, showing percentages of students who answered the questions correctly. Includes samples of students' written responses. (SLD)
Descriptors: Citizenship Education, Civics, High School Seniors, High Schools
Emenogu, Barnabas C.; Childs, Ruth A. – Canadian Journal of Education, 2005
A test item exhibits differential item functioning (DIF) if students with the same ability find it differentially difficult. When the item is administered in French and English, differences in language difficulty and meaning are the most likely explanations. However, curriculum differences may also contribute to DIF. The responses of Ontario…
Descriptors: Foreign Countries, Test Items, Exhibits, Translation
Lorenzo-Seva, Urbano; Rodriguez-Fornells, Antoni – Psychometrika, 2006
Personality tests often consist of a set of dichotomous or Likert items. These response formats are known to be susceptible to an agreeing-response bias called acquiescence. The common assumption in balanced scales is that the sum of appropriately reversed responses should be reasonably free of acquiescence. However, inter-item correlation (or…
Descriptors: Factor Analysis, Correlation, Factor Structure, Personality Measures
Andersson, Luanne – Communication Disorders Quarterly, 2005
The author discusses five key issues related to the adequacy of tests of children's language. Within each key issue, she asks test adequacy questions, accompanied by criteria for determining adequacy. The author also reviews the information found in the manuals for four norm-referenced, standardized tests of language development to illustrate…
Descriptors: Standardized Tests, Language Acquisition, Children, Test Validity

Direct link
