Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Coe, Robert – Oxford Review of Education, 2008
The comparability of examinations in different subjects has been a controversial topic for many years and a number of criticisms have been made of statistical approaches to estimating the "difficulties" of achieving particular grades in different subjects. This paper argues that if comparability is understood in terms of a linking…
Descriptors: Test Items, Grades (Scholastic), Foreign Countries, Test Bias
Hall, John D.; Howerton, D. Lynn; Jones, Craig H. – Research in the Schools, 2008
The No Child Left Behind Act and the accountability movement in public education caused many states to develop criterion-referenced academic achievement tests. Scores from these tests are often used to make high stakes decisions. Even so, these tests typically do not receive independent psychometric scrutiny. We evaluated the 2005 Arkansas…
Descriptors: Criterion Referenced Tests, Achievement Tests, High Stakes Tests, Public Education
Chauvot, Jennifer B.; Benson, Sharon L. D. – Mathematics Teaching in the Middle School, 2008
This article shares card-sorting activities that involve state-mandated test items to use with prospective and practicing mathematics teachers to teach about accountability measures while exploring reform-minded mathematics instruction recommendations. (Contains 2 figures.)
Descriptors: Test Items, Mathematics Achievement, Mathematics Teachers, Accountability
Church, Wesley T., II; Wakeman, Emily E.; Miller, Sarah L.; Clements, Carl B.; Sun, Fei – Research on Social Work Practice, 2008
Objectives: The objective of this study was to examine the nature of individual attitudes toward sex offenders. Because the term "sex offender" tends to evoke strong emotions, and given that open-ended self reports tend to be highly subjective, particularly in the context of such pointed terminology, this study sought to develop an attitude…
Descriptors: Sexual Abuse, Community Attitudes, Measures (Individuals), Psychometrics
Shapiro, Amy – Journal of the Scholarship of Teaching and Learning, 2009
Student evaluations of a large General Psychology course indicate that students enjoy the class a great deal, yet attendance is low. An experiment was conducted to evaluate a personal response system as a solution. Attendance rose by 30% as compared to extra credit as an inducement, but was equivalent to offering pop quizzes. Performance on test…
Descriptors: Test Items, Instructional Effectiveness, Learning Strategies, Classroom Techniques
Overbeek, Geertjan; Ha, Thao; Scholte, Ron; de Kemp, Raymond; Engels, Rutger C. M. E. – Journal of Adolescence, 2007
This study examined the psychometric properties of an adolescent version of the "triangular love scale" (TLS), which assesses three components of romantic relationships: intimacy, passion, and commitment. Using data from 435 Dutch adolescents aged 12-18 years, we found evidence for convergent validity, showing that dimensions of…
Descriptors: Measures (Individuals), Psychometrics, Test Validity, Intimacy
Chiat, Shula; Roy, Penny – Journal of Speech, Language, and Hearing Research, 2007
Purpose: To determine the psychometric properties of the Preschool Repetition (PSRep) Test (Roy & Chiat, 2004), to establish the range of performance in typically developing children and variables affecting this performance, and to compare the performance of clinically referred children. Method: The PSRep Test comprises 18 words and 18…
Descriptors: Phonology, Psychometrics, Interrater Reliability, Followup Studies
Jeong, Yoonkyung; Levine, Susan C.; Huttenlocher, Janellen – Journal of Cognition and Development, 2007
This study examines the development of children's ability to reason about proportions that involve either discrete entities or continuous amounts. Six-, 8- and 10-year olds were presented with a proportional reasoning task in the context of a game involving probability. Although all age groups failed when proportions involved discrete quantities,…
Descriptors: Age, Children, Probability, Cognitive Development
Talbot, Robert M.; Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2007
At the core of the argument-based approach to test validation as it has been presented by Kane (1992, 2004, 2006) is a relatively simple premise: test validity is demonstrated by linking the score that is observed from a test instrument to the use of that score for some subsequent inference. Details, however, are not so simple: How does one craft…
Descriptors: Test Validity, Inferences, Knowledge Base for Teaching, Mathematics Education
Pommerich, Mary – Journal of Technology, Learning, and Assessment, 2007
Computer administered tests are becoming increasingly prevalent as computer technology becomes more readily available on a large scale. For testing programs that utilize both computer and paper administrations, mode effects are problematic in that they can result in examinee scores that are artificially inflated or deflated. As such, researchers…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Scores
Georgiadou, Elissavet; Triantafillou, Evangelos; Economides, Anastasios A. – Journal of Technology, Learning, and Assessment, 2007
Since researchers acknowledged the several advantages of computerized adaptive testing (CAT) over traditional linear test administration, the issue of item exposure control has received increased attention. Due to CAT's underlying philosophy, particular items in the item pool may be presented too often and become overexposed, while other items are…
Descriptors: Adaptive Testing, Computer Assisted Testing, Scoring, Test Items
Penfield, Randall D. – Educational and Psychological Measurement, 2007
The standard error of the maximum likelihood ability estimator is commonly estimated by evaluating the test information function at an examinee's current maximum likelihood estimate (a point estimate) of ability. Because the test information function evaluated at the point estimate may differ from the test information function evaluated at an…
Descriptors: Simulation, Adaptive Testing, Computation, Maximum Likelihood Statistics
Giangreco, Michael F.; Broer, Stephen M. – Focus on Autism and Other Developmental Disabilities, 2007
This article describes the development of and directions for using a 16-item screening tool designed to assist cross-stakeholder school teams in determining the extent to which they may be over reliant on special education paraprofessionals or using them inappropriately. The content of the tool is based on contemporary, descriptive research…
Descriptors: Elementary Secondary Education, Screening Tests, Special Education, Paraprofessional Personnel
Childs, Ruth A.; Jaciw, Andrew P.; Saunders, Kelsey – International Journal of Testing, 2007
Many approaches to standard-setting use item calibration and student score estimation results to structure panelists' tasks. However, this requires collecting standard-setting judgments after the item analysis results are available. The Scoring Guide Alignment approach collects standard-setting judgments during the scoring sessions from teachers…
Descriptors: Testing Programs, Scoring, Item Analysis, Test Items
Lee, Sang Min; Puig, Ana; Pasquarella-Daley, Lauren; Denny, George; Rai, Ann Allen; Dallape, Aprille; Parker, Woodrow Max – Measurement and Evaluation in Counseling and Development, 2007
This article describes the revision of the White Racial Consciousness Development Scale (D. Claney & W. M. Parker, 1989). A multistage approach including item generation, item refinement and selection, and evaluation of score validity and reliability was used to test construction and validation. Implications for theory, practice, and future…
Descriptors: Measures (Individuals), Test Construction, Test Items, Scores

Peer reviewed
Direct link
