Publication Date
| In 2026 | 2 |
| Since 2025 | 469 |
| Since 2022 (last 5 years) | 1948 |
| Since 2017 (last 10 years) | 4520 |
| Since 2007 (last 20 years) | 7005 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10011 |
| Test Construction | 4371 |
| Foreign Countries | 3834 |
| Psychometrics | 2429 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 839 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 130 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Akarsu, Bayram – European Journal of Physics Education, 2012
Physics educators around the world often need reliable diagnostic materials to measure students' understanding of physics concept in high school. The purpose of this study is to evaluate a new diagnostic tool on High School Optics concept. Test of Conceptual Understanding on High School Optics (TOCUSO) consists of 25 conceptual items that measures…
Descriptors: High Schools, Secondary School Science, Optics, Concept Teaching
Bell, Tracey R. – ProQuest LLC, 2012
The purpose of this study was to (a) evaluate and discuss strengths and weaknesses in the current system of identifying gifted students and (b) investigate the effectiveness of using the Naglieri Nonverbal Abilities Test to identify underrepresented and gifted minority students in grades one through five in Central Mississippi. The study consisted…
Descriptors: Selection Criteria, Gifted, Talent Identification, Ability Identification
Tsagari, Dina, Ed.; Csepes, Ildiko, Ed. – Peter Lang Frankfurt, 2012
The Guidelines for Good Practice of the European Association for Language Testing and Assessment (EALTA) stress the importance of collaboration between all parties involved in the process of developing instruments, activities and programmes for testing and assessment. Collaboration is considered to be as important as validity and reliability,…
Descriptors: Sign Language, Testing, Language Tests, Test Validity
Amrein-Beardsley, Audrey; Collins, Clarin – Education Policy Analysis Archives, 2012
The SAS Educational Value-Added Assessment System (SAS[R] EVAAS[R]) is the most widely used value-added system in the country. It is also self-proclaimed as "the most robust and reliable" system available, with its greatest benefit to help educators improve their teaching practices. This study critically examined the effects of SAS[R] EVAAS[R] as…
Descriptors: Evidence, Urban Schools, Private Schools, Program Effectiveness
Maederer, Jennifer L. – ProQuest LLC, 2011
The primary purpose of the current research was to determine whether low-income, high-risk young children's emergent literacy skills, including measures of oral language and letter knowledge, were related to their social competence. Other goals included determining the reliability of a social competence rating scale, the Social Competence…
Descriptors: Disadvantaged Youth, At Risk Students, Emergent Literacy, Reading Skills
Shattuck, Dominick; Corbell, Kristen A.; Osbourne, Jason W.; Knezek, Gerald; Christensen, Rhonda; Grable, Lisa Leonor – Computers in the Schools, 2011
In this article the authors present a confirmatory factor analysis of the Teachers' Attitudes Toward Computers (TAC) and the Teachers' Attitudes Toward Information Technology (TAT) scales by Christensen and Knezek (1996, 1998) using large samples from three states. The TAC was reduced from 98 items and nine factors to 35 items and eight factors,…
Descriptors: Computer Attitudes, Information Technology, Educational Technology, Computer Uses in Education
Glesser, Andrea L. – ProQuest LLC, 2010
This study provided a preliminary analysis of concurrent and discriminative validity for the "Early Literacy Progress Monitoring Assessment Tool" (ELP-MAT; Kaderavek, 2009). Sixty preschool students between the ages of 3 years, 6 months and 5 years of age, from early childhood programs in Northwest Ohio, participated in the study. The…
Descriptors: Early Reading, Early Childhood Education, Language Impairments, Phonological Awareness
Anderson, Daniel; Lai, Cheng-Fei; Nese, Joseph F. T.; Park, Bitnara Jasmine; Saez, Leilani; Jamgochian, Elisa; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2010
In the following technical report, we present evidence of the technical adequacy of the easyCBM[R] math measures in grades K-2. In addition to reliability information, we present criterion-related validity evidence, both concurrent and predictive, and construct validity evidence. The results represent data gathered throughout the 2009/2010 school…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Test Reliability, Test Validity
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Graham, Aislin R.; Sherry, Simon B.; Stewart, Sherry H.; Sherry, Dayna L.; McGrath, Daniel S.; Fossum, Kristin M.; Allen, Stephanie L. – Journal of Counseling Psychology, 2010
Perfectionistic concerns (i.e., negative reactions to failures, concerns over others' criticism and expectations, and nagging self-doubts) are a putative risk factor for depressive symptoms. This study proposes and supports the existential model of perfectionism and depressive symptoms (EMPDS), a conceptual model aimed at explaining why…
Descriptors: Foreign Countries, Risk, Depression (Psychology), Models
Coker, David L., Jr.; Ritchey, Kristen D. – Exceptional Children, 2010
Despite the growing body of research on writing assessment, little attention has been devoted to developing and validating measures for beginning writers. This study examined the technical adequacy of a Sentence Writing measure with 233 students in kindergarten and first grade. The reliability, validity, and sensitivity to growth were investigated…
Descriptors: Writing Evaluation, Curriculum Based Assessment, Writing Tests, Test Validity
Branscum, Paul; Sharma, Manoj; Kaye, Gail; Succop, Paul – Journal of Nutrition Education and Behavior, 2010
Objective: The objective of this study was to report the construct validity and internal consistency reliability of the Food Behavior Checklist modified for children (FBC-MC), with low-income, Youth Expanded Food and Nutrition Education Program (EFNEP)-eligible children. Methods: Using a cross-sectional research design, construct validity was…
Descriptors: Check Lists, Research Design, Nutrition, Construct Validity
Lew, Magdeleine D. N.; Alwis, W. A. M.; Schmidt, Henk G. – Assessment & Evaluation in Higher Education, 2010
The purpose of the two studies presented here was to evaluate the accuracy of students' self-assessment ability, to examine whether this ability improves over time and to investigate whether self-assessment is more accurate if students believe that it contributes to improving learning. To that end, the accuracy of the self-assessments of 3588…
Descriptors: Self Evaluation (Individuals), Beliefs, Learning Processes, Correlation
Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010
The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…
Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Peer reviewed
Direct link
