Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017
Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…
Descriptors: Test Bias, Test Reliability, Performance, Scores
Stefan, Catrinel A.; Miclea, Mircea – School Mental Health, 2017
The emotional competence screening and the social competence screening for parents and teachers were developed in Romania as brief, multiinformant, strength-focused assessment tools to identify children at risk of underdeveloped social-emotional competencies. The objective of the current study was to gather further reliability and validity…
Descriptors: Social Development, Emotional Development, Preschool Children, Screening Tests
Brasher, Casey F. – ProQuest LLC, 2017
Reading comprehension assessments often lack instructional utility because they do not accurately pinpoint why a student has difficulty. The varying formats, directions, and response requirements of comprehension assessments lead to differential measurement of underlying skills and contribute to noted amounts of unshared variance among tests. Maze…
Descriptors: Progress Monitoring, Grade 4, Reading Comprehension, Reading Instruction
Sides, Meredith Louise Carr – ProQuest LLC, 2017
The study adapted Astin's I-E-O model and utilized multiple regression analyses to predict faculty attitudes toward developmental education. The study utilized a cross-sectional survey design to survey faculty members at 27 different higher education institutions in the state of Alabama. The survey instrument was a self-designed questionnaire that…
Descriptors: Teacher Attitudes, Teacher Characteristics, Multiple Regression Analysis, Case Studies
McLaughlin, Tara W.; Snyder, Patricia A.; Algina, James – Grantee Submission, 2017
The Learning Target Rating Scale (LTRS) is a measure designed to evaluate the quality of teacher-developed learning targets for embedded instruction for early learning. In the present study, we examined the measurement dependability of LTRS scores by conducting a generalizability study (G-study). We used a partially nested, three-facet model to…
Descriptors: Generalizability Theory, Scores, Rating Scales, Evaluation Methods
Türkel, Ali; Özdemir, Eylem Ezgi; Akbulut, Serdar – Online Submission, 2017
In this study, a reading culture scale was developed that can be used in determining the reading cultures of teacher candidates who study in education faculties. When looking at the literature, the attitudes about reading, habit, perception, self-efficacy and so on. It was seen that there were scales measuring the concepts but not a scale that…
Descriptors: Preservice Teachers, Test Construction, Test Validity, Test Reliability
Kirbas, Abdulkadir – Online Submission, 2017
The aim of this study is to determine the realization levels of values education implementations in teaching Turkish by taking the opinions of Turkish teachers. The sample of this study conducted in the survey model comprises 108 Turkish teachers employed at different secondary schools in Erzurum, Bayburt, Gümüshane and Trabzon in the Spring…
Descriptors: Foreign Countries, Secondary School Teachers, Teacher Attitudes, Opinions
Li, Haiying; Gobert, Janice; Dickler, Rachel – International Educational Data Mining Society, 2017
Scientific explanations, which include a claim, evidence, and reasoning (CER), are frequently used to measure students' deep conceptual understandings of science. In this study, we developed an automated scoring approach for the CER that students constructed as a part of virtual inquiry (e.g., formulating questions, analyzing data, and warranting…
Descriptors: Automation, Science Instruction, Inquiry, Educational Assessment
Schoen, Robert C.; Bray, Wendy; Wolfe, Christopher; Tazaz, Amanda M.; Nielsen, Lynne – Grantee Submission, 2017
This study reports on the development and field study of K-TEEM, a web-based assessment instrument designed to measure mathematical knowledge for teaching (MKT) at the early elementary level. The development process involved alignment with early elementary curriculum standards, expert review of items and scoring criteria, cognitive interviews with…
Descriptors: Teacher Evaluation, Elementary School Teachers, Mathematics, Pedagogical Content Knowledge
Reid, Tingting – AERA Online Paper Repository, 2017
Children's development of attitudes about science begins early. However, little is known regarding how children feel about science in the preschool years. The lack of a psychologically sound instrument that can appropriately capture young children's attitude towards science is another challenge faced by early childhood science researchers. The…
Descriptors: Preschool Children, Preschool Education, Student Attitudes, Test Construction
Mamaril, Natasha A.; Usher, Ellen L.; Li, Caihong R.; Economy, D. Ross; Kennedy, Marian S. – Journal of Engineering Education, 2016
Background: Self-efficacy has been shown to be positively related to undergraduate engineering students' achievement. Designing self-efficacy measures to assess the multifaceted skills required of engineers could improve the predictive relationship between efficacy beliefs and performance. Purpose: This study evaluates the factor structure,…
Descriptors: Undergraduate Students, Engineering Education, Self Efficacy, Academic Achievement
Sulz, Lauren; Temple, Viviene; Gibbons, Sandra – Physical Educator, 2016
The aim of this research was to develop measures to provide valid and reliable representation of the motivational states and psychological needs proposed by the self-determination theory (Deci & Ryan, 1985, 2000) within a physical education context. Based on theoretical underpinnings of self-determination theory, two questionnaires were…
Descriptors: Self Determination, Physical Education, Student Motivation, Test Validity
Hammer, Hugo Lewi; Habib, Laurence – EURASIA Journal of Mathematics, Science & Technology Education, 2016
The most common way to grade students in courses at university and university college level is to use final written exams. The aim of final exams is generally to provide a reliable and a valid measurement of the extent to which a student has achieved the learning outcomes for the course. A source of uncertainty in grading students based on an exam…
Descriptors: Grading, Mathematics Tests, Science Tests, Physics
Padilla-Walker, Laura Maria; Jensen, Lene Arnett – International Journal of Behavioral Development, 2016
Moral psychology has been moving toward consideration of multiple kinds of moral concepts and values, such as the Ethics of Autonomy, Community, and Divinity. While these three ethics have commonly been measured qualitatively, the current study sought to validate the long and short forms of the Ethical Values Assessment (EVA), which is a…
Descriptors: Ethics, Questionnaires, Error of Measurement, Moral Values
Godor, Brian P. – Teaching in Higher Education, 2016
Student learning approaches research has been built upon the notions of deep and surface learning. Despite its status as part of the educational research canon, the dichotomy of deep/surface has been critiqued as constraining the debate surrounding student learning. Additionally, issues of content validity have been expressed concerning…
Descriptors: Higher Education, Q Methodology, Academic Achievement, Masters Programs

Peer reviewed
Direct link
