Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Lorsbach, Thomas C.; Reimer, Jason F. – Journal of Genetic Psychology, 2005
The authors measured memory for individual features (objects only or locations only) and the combination of those features (objects and locations) in 9-, 12-, and 21-year-old students with a "yes" or "no" recognition task. Analysis of recognition memory performance (d' scores) revealed that although age differences existed in memory for individual…
Descriptors: Preadolescents, Grade 3, Grade 6, Young Adults
Dillon, Frank; Worthington, Roger L. – Journal of Counseling Psychology, 2003
Five studies on the development of the Lesbian, Gay, and Bisexual Affirmative Counseling Self-Efficacy Inventory (LGB-CSI) were conducted. Exploratory and confirmatory factor analyses of an initial pool of 64 items yielded 5 factors that assess counselor self-efficacy to perform lesbian, gay, and bisexual (LGB) affirmative counseling behaviors…
Descriptors: Validity, Self Efficacy, Social Desirability, Homosexuality
Hula, William; Doyle, Patrick J.; McNeil, Malcolm R.; Mikolic, Joseph M. – Journal of Speech, Language, and Hearing Research, 2006
The purpose of this research was to examine the validity of the 55-item Revised Token Test (RTT) and to compare traditional and Rasch-based scores in their ability to detect group differences and change over time. The 55-item RTT was administered to 108 left- and right-hemisphere stroke survivors, and the data were submitted to Rasch analysis.…
Descriptors: Test Items, Brain Hemisphere Functions, Individual Differences, Difficulty Level
Lecavalier, Luc; Aman, Michael G.; Scahill, Lawrence; McDougle, Christopher J.; McCracken, James T.; Vitiello, Benedetto; Tierney, Elaine; Arnold, L. Eugene; Ghuman, Jaswinder K.; Loftin, Rachel L.; Cronin, Pegeen; Koenig, Kathleen; Posey, David J.; Martin, Andres; Hollway, Jill; Lee, Lisa S.; Kau, Alice S. M. – American Journal on Mental Retardation, 2006
The factor structure, internal consistency, and convergent validity of the Autism Diagnostic Interview-Revised (ADI-R) algorithm items were examined in a sample of 226 youngsters with pervasive developmental disabilities. Exploratory factor analyses indicated a three-factor solution closely resembling the original algorithm and explaining 38% of…
Descriptors: Test Validity, Measures (Individuals), Measurement Techniques, Autism
Crisp, Geoffrey T.; Palmer, Edward J. – Journal of University Teaching and Learning Practice, 2007
The appropriate analysis of students' responses to an assessment is an essential step in improving the quality of the assessment itself as well as staff teaching and student learning. Many academics are unfamiliar with the formal processes used to analyze assessment results; the standard statistical methods associated with analyzing the validity…
Descriptors: Multiple Choice Tests, Student Evaluation, Test Results, Test Construction
Bailey, Alison L.; Huang, Becky H.; Shin, Hye Won; Farnsworth, Tim; Butler, Frances A. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007
Within an evidentiary framework for operationally defining academic English language proficiency (AELP), linguistic analyses of standards, classroom discourse, and textbooks have led to specifications for assessment of AELP. The test development process described here is novel due to the emphasis on using linguistic profiles to inform the …
Descriptors: Grade 5, Textbooks, Psychometrics, Profiles
Lievens, Filip; Sackett, Paul R. – Journal of Applied Psychology, 2007
This study used principles underlying item generation theory to posit competing perspectives about which features of situational judgment tests might enhance or impede consistent measurement across repeat test administrations. This led to 3 alternate-form development approaches (random assignment, incident isomorphism, and item isomorphism). The…
Descriptors: Validity, High Stakes Tests, Test Construction, Testing
Sharifi Ashtiani, Nahid; Babaii, Esmat – Studies in Educational Evaluation, 2007
For decades traditional methods of testing have been criticized for saying relatively little reliably about students' ability as well as causing anxiety, which can negatively affect students' recall of learned information. The reform movement with its innovative approaches focusing on learner-centered education perceives assessment as an…
Descriptors: Teaching Methods, Program Effectiveness, Grade 11, Test Construction
Al-A'ali, Mansoor – Educational Technology & Society, 2007
Computer adaptive testing is the study of scoring tests and questions based on assumptions concerning the mathematical relationship between examinees' ability and the examinees' responses. Adaptive student tests, which are based on item response theory (IRT), have many advantages over conventional tests. We use the least square method, a…
Descriptors: Educational Testing, Higher Education, Elementary Secondary Education, Student Evaluation
Schaeffer, Gary A.; And Others – 1993
This report contains results of a field test conducted to determine the relationship between a Graduate Records Examination (GRE) linear computer-based test (CBT) and a paper-and-pencil (P&P) test with the same items. Recent GRE examinees participated in the field test by taking either a CBT or the P&P test. Data from the field test…
Descriptors: Attitudes, College Graduates, Computer Assisted Testing, Equated Scores
Henning, Grant – 1991
Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…
Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment
Webb, Melvin W., II; Miller, Eva R. – 1995
As constructed-response items become an integral part of educational assessments, setting student performance standards on constructed-response items has become an important issue. Two standard-setting methods, one used for setting standards on the National Assessment of Educational Progress (NAEP) in reading in grade 8 and the other used to set…
Descriptors: Comparative Analysis, Constructed Response, Criteria, Educational Assessment
Allen, Nancy L.; Donoghue, John R. – 1995
This Monte Carlo study examined the effect of complex sampling of items on the measurement of differential item functioning (DIF) using the Mantel-Haenszel procedure. Data were generated using a three-parameter logistic item response theory model according to the balanced incomplete block (BIB) design used in the National Assessment of Educational…
Descriptors: Computer Assisted Testing, Difficulty Level, Elementary Secondary Education, Identification
Way, Walter D.; And Others – 1992
This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…
Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory
Frary, Robert B. – 1995
This digest presents a list of recommendations for writing multiple-choice test items, based on psychometrics and logical deduction. Questions should ask more than mere knowledge of facts and should not contain superfluous information as an introduction to the question. Each question should focus on some specific aspect of the course, and the item…
Descriptors: Culture Fair Tests, Distractors (Tests), Educational Assessment, Item Bias

Peer reviewed
Direct link
