Publication Date
| In 2026 | 2 |
| Since 2025 | 469 |
| Since 2022 (last 5 years) | 1948 |
| Since 2017 (last 10 years) | 4520 |
| Since 2007 (last 20 years) | 7005 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10011 |
| Test Construction | 4371 |
| Foreign Countries | 3834 |
| Psychometrics | 2429 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 839 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 130 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Sucuoglu, Bülbin; Bakkaloglu, Hatice; Iscen Karasu, Fadime; Demir, Seyda; Akalin, Selma – Educational Sciences: Theory and Practice, 2014
The purpose of this study is to develop the Inclusion Knowledge Test (IKT) for assessing preschool teachers' knowledge of inclusive practices and to examine its psychometric characteristics. To achieve this purpose, the researchers wrote short stories (vignettes) focusing on the various aspects of inclusive practices, such as assessing the…
Descriptors: Preschool Teachers, Knowledge Level, Inclusion, Psychometrics
O'Reilly, Tenaha; Weeks, Jonathan; Sabatini, John; Halderman, Laura; Steinberg, Jonathan – Educational Psychology Review, 2014
When designing a reading intervention, researchers and educators face a number of challenges related to the focus, intensity, and duration of the intervention. In this paper, we argue there is another fundamental challenge--the nature of the reading outcome measures used to evaluate the intervention. Many interventions fail to demonstrate…
Descriptors: Intervention, Reading Instruction, Apprenticeships, Summative Evaluation
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Schoen, Robert C.; LaVenia, Mark; Champagne, Zachary M.; Farina, Kristy; Tazaz, Amanda M. – Grantee Submission, 2017
The following report describes an assessment instrument called the Mathematics Performance and Cognition (MPAC) interview. The MPAC interview was designed to measure two outcomes of interest. It was designed to measure first and second graders' mathematics achievement in number, operations, and equality, and it was also designed to gather…
Descriptors: Interviews, Test Construction, Psychometrics, Elementary School Mathematics
Schoen, Robert C.; LaVenia, Mark; Champagne, Zachary M.; Farina, Kristy – Grantee Submission, 2017
This report provides an overview of the development, implementation, and psychometric properties of a student mathematics interview designed to assess first- and second-grade student achievement and thinking processes. The student interview was conducted with 622 first- or second-grade students in 22 schools located in two public school districts…
Descriptors: Interviews, Test Construction, Psychometrics, Elementary School Mathematics
Skinner, Rebecca R.; Lomax, Erin – Congressional Research Service, 2017
Federal education legislation continues to emphasize the role of assessment in elementary and secondary schools. Perhaps most prominently, the Elementary and Secondary Education Act (ESEA), as amended by the Every Student Succeeds Act (ESSA; P.L. 114-95), requires the use of test-based educational accountability systems in states and specifies the…
Descriptors: Educational Assessment, Educational Legislation, Elementary Secondary Education, Federal Legislation
Partnership for Assessment of Readiness for College and Careers, 2018
The purpose of this technical report is to describe the third operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) assessments in the 2016-2017 academic year. PARCC is a state-led consortium creating next-generation assessments that, compared to traditional K-12 assessments, more accurately…
Descriptors: College Readiness, Career Readiness, Common Core State Standards, Language Arts
Furlong, Michael J.; Dowdy, Erin; Nylund-Gibson, Karen – Grantee Submission, 2018
This manual reports on the development and validation of the original Social Emotional Health Survey-Secondary (carried out between 2012 and 2017). We shared the first version of the SEHS-S because it had sufficient validation evidence based on research completed by 2015; hence, the form reported on in this manual is called the SEHS-S (2015)…
Descriptors: Surveys, Psychometrics, Test Validity, Mental Health
Yesil, Rustu – Education, 2012
The objective of this study is to develop a scale in order to determine the reasons why students delay academic tasks and the levels that they are affected from these reasons. The study group was composed of a total of 447 students from the faculty of education. The KMO value of this scale composed of 43 items collected under six factors was…
Descriptors: Factor Analysis, Validity, Measurement, Test Reliability
Tien, Hsiu-Lan Shelley; Wang, Yu-Chen; Chu, Hui-Chuang; Huang, Tsu-Lun – Journal of Vocational Behavior, 2012
The present study tested the reliability and validity of the Career Adapt-Ability Scale--Taiwan Form (CAAS-Taiwan Form). The CAAS consists of four scales, each with six items, which measure concern, control, curiosity, and confidence as psychosocial resources for managing occupational transitions, developmental tasks, and work traumas. Internal…
Descriptors: Foreign Countries, Vocational Adjustment, Measures (Individuals), Psychometrics
Davies, Alan – Language Testing, 2012
In this article, the author begins by discussing four challenges on the concept of validity. These challenges are: (1) the appeal to logic and syllogistic reasoning; (2) the claim of reliability; (3) the local and the universal; and (4) the unitary and the divisible. In language testing validity cannot be achieved directly but only through a…
Descriptors: Language Tests, Test Validity, Test Reliability, Testing
Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A. – International Journal of Rehabilitation Research, 2012
The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
Descriptors: Head Injuries, Neurological Impairments, Patients, Life Satisfaction
Erford, Bradley T.; Alsamadi, Silvana C. – Measurement and Evaluation in Counseling and Development, 2012
Score reliability and validity of parent responses concerning their 10- to 17-year-old students were analyzed using the Screening Test for Emotional Problems-Parent Report (STEP-P), which assesses a variety of emotional problems classified under the Individuals with Disabilities Education Improvement Act. Score reliability, convergent, and…
Descriptors: Screening Tests, Emotional Problems, Children, Adolescents
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Ghirardelli, Alyssa; Quinn, Valerie; Sugerman, Sharon – Journal of Nutrition Education and Behavior, 2011
Objective: To develop a retail grocery instrument with weighted scoring to be used as an indicator of the food environment. Participants/Setting: Twenty six retail food stores in low-income areas in California. Intervention: Observational. Main Outcome Measure(s): Inter-rater reliability for grocery store survey instrument. Description of store…
Descriptors: Interrater Reliability, Marketing, Scoring, Correlation

Peer reviewed
Direct link
