Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008
As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…
Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores
Hodges, Kimberly – ProQuest LLC, 2010
The purpose of this dissertation study was to determine if a relationship existed between problem-based learning (PBL) content acquisition and academic achievement on teacher-made tests in Career and Technical Education (CTE) courses at the middle school level. The study sample consisted of 20 seventh-grade students enrolled in a CTE keyboarding…
Descriptors: Standardized Tests, Problem Based Learning, Pretests Posttests, Test Theory
Magno, Carlo – Online Submission, 2009
The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…
Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries
Moffett, David W.; Zhou, Yunfang – Online Submission, 2009
The Investigators hypothesized cooperating teachers' evaluations of candidates in clinical practice and field experiences would possess higher scores than those provided by clinical and education division faculty. However, the reasons for the higher scores proved to be much more complex than originally thought. While it was assumed that teachers…
Descriptors: Field Experience Programs, Cooperating Teachers, Student Teacher Supervisors, Clinical Supervision (of Teachers)
Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009
The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…
Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development
Ponterotto, Joseph G.; Park-Taylor, Jennie – Journal of Counseling Psychology, 2007
The present article integrates and expands on the special section contributions of K. O. Cokley (2007); J. E. Helms (2007); J. E. Trimble (2007); S. M. Quintana (2007); and J. S. Phinney and A. D. Ong (2007). The authors of the present article begin with a note on politics and ideology in writings on racial identity development and review general…
Descriptors: Ethnicity, Counseling Psychology, Racial Identification, Test Theory
Vassar, Matt; Hale, William – Social Indicators Research, 2007
Due to the emergence of positive psychology in recent years, a growing line of research has focused on aspects of psychological wellness rather than psychopathology. Within the context of positive psychology, life satisfaction has emerged as a key variable of study in relation to adult and youth populations. Accurate measurement of life…
Descriptors: Life Satisfaction, Test Reliability, Psychopathology, Psychometrics
Wang, Tzu-Hua; Wang, Kuo-Hua; Huang, Shih-Chieh – Computers & Education, 2008
Teacher assessment literacy is a key factor in the success of teaching, but some studies concluded that teachers lack it. The aim of this research is to propose the ''Practicing, Reflecting and Revising with WATA system (P2R-WATA) Assessment Literacy Development Model'' for improving pre-service teacher assessment literacy. WATA system offers…
Descriptors: Preservice Teacher Education, Test Items, Item Analysis, Literacy
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Peer reviewedSilverstein, A. B. – Journal of Clinical Psychology, 1985
Analyzes the intercorrelations among the 11 subtests of the Wechsler Adult Intelligence Scale (Revised) (WAIS-R) for nine age groups in the standardization sample. Results were more consistent with a three-factor solution than a two-factor solution. A single-factor solution may be at least as adequate as multifactor solutions. (Author/BH)
Descriptors: Cluster Analysis, Test Theory
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
von Davier, Alina A.; Fournier-Zajac, Stephanie; Holland, Paul W. – ETS Research Report Series, 2007
In the nonequivalent groups with anchor test (NEAT) design, there are several ways to use the information provided by the anchor in the equating process. One of the NEAT-design equating methods is the linear observed-score Levine method (Kolen & Brennan, 2004). It is based on a classical test theory model of the true scores on the test forms…
Descriptors: Equated Scores, Statistical Analysis, Test Items, Test Theory
Childs, Ruth A.; Dunn, Jennifer L.; van Barneveld, Christina; Jaciw, Andrew P. – International Journal of Testing, 2007
This study compares five scoring approaches for a test of clinical reasoning skills. All of the approaches incorporate information about the correct item responses selected and the errors, such as selecting too many responses or selecting a response that is inappropriate and/or harmful to the patient. The approaches are combinations of theoretical…
Descriptors: Scoring, Clinical Diagnosis, Thinking Skills, Reliability
Buchmann, Claudia; Condron, Dennis J.; Roscigno, Vincent J. – Social Forces, 2010
The authors welcome and appreciate the comments of Eric Grodsky and Sigal Alon on their article "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment." In their comments, Grodsky takes issue with several important theoretical and methodological aspects of their article and Alon highlights key processes…
Descriptors: Race, Educational Mobility, Test Preparation, College Entrance Examinations
Chapman, Jason E.; Sheidow, Ashli J.; Henggeler, Scott W.; Halliday-Boykins, Colleen A.; Cunningham, Phillippe B. – Journal of Child & Adolescent Substance Abuse, 2008
A unique application of the Many-Facet Rasch Model (MFRM) is introduced as the preferred method for evaluating the psychometric properties of a measure of therapist adherence to Contingency Management (CM) treatment of adolescent substance use. The utility of psychometric methods based in Classical Test Theory was limited by complexities of the…
Descriptors: Caregivers, Contingency Management, Rating Scales, Psychometrics

Direct link
