Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Nathanson, Lori – ProQuest LLC, 2009
This study examines the psychometric properties, reliability, and validity of a measure designed to assess fidelity of implementation of the "Responsive Classroom"[R] ("RC") approach. The Classroom Practices Teacher Survey (CPTS) assesses teachers' use of the "RC" approach, a social and emotional learning (SEL) intervention currently under…
Descriptors: Psychometrics, Reliability, Validity, Teacher Surveys
Wise, Steven L.; Ma, Lingling; Kingsbury, G. Gage; Hauser, Carl – Northwest Evaluation Association, 2010
This study investigated the relationships between when a test is administered and the amount of test-taking effort exhibited by examinees. Three time-related variables were investigated: the time of year the test was administered, the day of the week the test event occurred, and the time of day that the test event occurred. Mean effort did not…
Descriptors: Academic Achievement, Test Wiseness, Investigations, Schematic Studies
Saladin, Shawn P.; Reid, Christine; Shiels, John – Rehabilitation Research, Policy, and Education, 2011
The Commission on Rehabilitation Counselor Certification (CRCC) has taken a proactive stance on perceived test inequities of the Certified Rehabilitation Counselor (CRC) exam as it relates to people who are prelingually deaf and hard of hearing. This article describes the process developed and implemented by the CRCC to help maximize test equity…
Descriptors: Test Items, Rehabilitation Counseling, Counselor Certification, Deafness
Wiberg, Marie; Sundstrom, Anna – Practical Assessment, Research & Evaluation, 2009
A common problem in predictive validity studies in the educational and psychological fields, e.g. in educational and employment selection, is restriction in range of the predictor variables. There are several methods for correcting correlations for restriction of range. The aim of this paper was to examine the usefulness of two approaches to…
Descriptors: Predictive Validity, Predictor Variables, Correlation, Mathematics
Poliandri, Donatella; Cardone, Michele; Muzzioli, Paola; Romiti, Sara – Online Submission, 2011
The purpose of this study is to validate a test anxiety scale for Italian students. The scale is part of a questionnaire administered after the students' annual competence test by the National Institute for the Educational Evaluation of Instruction and Training (INVALSI). The aim of the scale is to explore the anxiety levels of Italian students…
Descriptors: Reading Comprehension, Standardized Tests, Rating Scales, Questionnaires
Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011
Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…
Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests
Squires, Jane K.; Waddell, Misti L.; Clifford, Jantina R.; Funk, Kristin; Hoselton, Robert M.; Chen, Ching-I – Topics in Early Childhood Special Education, 2013
Psychometric and utility studies on Social Emotional Assessment Measure (SEAM), an innovative tool for assessing and monitoring social-emotional and behavioral development in infants and toddlers with disabilities, were conducted. The Infant and Toddler SEAM intervals were the study focus, using mixed methods, including item response theory…
Descriptors: Psychometrics, Evaluation Methods, Social Development, Emotional Development
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Ziegler, Albert; Ziegler, Albert – High Ability Studies, 2009
The aim of this paper is to demonstrate the dramatic consequences the application of cut-off points can have in the practice of identifying gifted individuals. The paradoxical attenuation effect describes the frequent situation in which measurements of the gifts and talents individuals possess are lower than their true values. However, in…
Descriptors: Gifted, Academic Achievement, Test Theory, Measurement
Yoon, So Yoon – ProQuest LLC, 2011
Working under classical test theory (CTT) and item response theory (IRT) frameworks, this study investigated psychometric properties of the Revised Purdue Spatial Visualization Tests: Visualization of Rotations (Revised PSVT:R). The original version, the PSVT:R was designed by Guay (1976) to measure spatial visualization ability in…
Descriptors: Undergraduate Students, Test Bias, Guessing (Tests), Construct Validity
Coniam, David – New Horizons in Education, 2011
Background: This article reports a study into the double marking of Liberal Studies in Hong Kong. This is now a compulsory subject in Hong Kong's Years 10-12 curriculum which, when first examined in the new Hong Kong Diploma of Secondary Education in 2012, will increase its candidature from its current 3,300 to 80,000. Aims: To examine the…
Descriptors: Tests, Foreign Countries, English (Second Language), Second Language Learning
Wang, Xin – New Horizons in Education, 2011
Background: Service-learning as a pedagogy and curricular consideration to revitalize undergraduate education has been flourishing in the Asia-Pacific Region for years. The W. T. Chan Fellowship Program is designed as an intercultural service-learning program, with the fellows coming from China and Hong Kong, to experience service-learning in the…
Descriptors: Undergraduate Study, International Programs, Service Learning, Personal Autonomy
Benefiel, Diane – ProQuest LLC, 2011
The purpose of this study was to: 1) analyze the relationship of preprogram and nursing program variables on National Council Licensure Examination for Registered Nurses (NCLEX-RN) success and failure, and 2) develop a model to predict success and failure on the NCLEX-RN. The convenience sample was comprised of 245 spring, summer, and fall midterm…
Descriptors: Grade Point Average, American Indians, Whites, African Americans
Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Direct link
Peer reviewed
