Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Boston, Melissa D.; Candela, Amber G. – ZDM: The International Journal on Mathematics Education, 2018
The Instructional Quality Assessment (IQA) identifies the nature and quality of classroom instruction by considering students' opportunities to engage in cognitively demanding mathematical work and discussions. The IQA assesses ambitious mathematics instruction based on the following dimensions: potential of the task, task implementation, rigor of…
Descriptors: Mathematics Instruction, Educational Assessment, Educational Quality, Scoring Rubrics
Kamis, Ömer; Dogan, C. Deha – Journal of Education and Learning, 2018
This research aimed to compare the G and Phi coefficients estimated in Decision studies in Generalizability theory and obtained in actual cases for the same conditions of similar facets by using crossed design. The research was conducted as pure research on 120 individuals (students), six items and 12 raters. An achievement test composed of six…
Descriptors: Generalizability Theory, Decision Making, Reliability, Computation
Durham, Mary F.; Knight, Jennifer K.; Bremers, Emily K.; DeFreece, Jameson D.; Paine, Alex R.; Couch, Brian A. – International Journal of STEM Education, 2018
Background: The Scientific Teaching (ST) pedagogical framework encompasses many of the best practices recommended in the literature and highlighted in national reports. Understanding the growth and impact of ST requires instruments to accurately measure the extent to which practitioners implement ST in their courses. Researchers have typically…
Descriptors: Science Instruction, Teaching Models, Interrater Reliability, Measures (Individuals)
Carr, W. David; Volberding, Jennifer – Athletic Training Education Journal, 2018
Context: Measurements of the opinions of alumni and employers are utilized by many athletic training education programs (ATEPs). Information obtained from such measurements can be useful in determining the strengths and weaknesses of a program. Objective: To describe the development of two instruments designed to elicit the opinions of recent…
Descriptors: Alumni, Employer Attitudes, Employers, Surveys
García-Ros, Rafael; Fuentes, María C.; Hernàndez i Dobon, Francisco; Villar-Aguilés, Alícia; Pérez-González, Francisco – Electronic Journal of Research in Educational Psychology, 2018
Introduction: This study focuses on the development and validation of the Mentoring Processes Assessment Questionnaire--MPAQ- designed to assess mentoring processes in peermentoring programs aimed at first-year university students. Method: Participants in the study were 354 first-year students from a broad set of degrees at the University of…
Descriptors: Mentors, College Freshmen, Peer Relationship, Test Construction
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Watkins, Marley W.; Dombrowski, Stefan C.; Canivez, Gary L. – International Journal of School & Educational Psychology, 2018
The reliability and factorial validity of the Wechsler Intelligence Scale for Children--Fifth Edition: Canadian (WISC-V[superscript CDN]) was investigated. The higher-order model preferred by Wechsler (2014b) contained five group factors but lacked discriminant validity. An alternative bifactor model with four group factors and one general factor,…
Descriptors: Foreign Countries, Children, Intelligence Tests, Test Validity
Ziegler, Laura; Garfield, Joan – Statistics Education Research Journal, 2018
The purpose of this study was to develop the Basic Literacy In Statistics (BLIS) assessment for students in an introductory statistics course, at the postsecondary level, that includes, to some extent, simulation-based methods. The definition of statistical literacy used in the development of the assessment was the ability to read, understand, and…
Descriptors: Statistics, Literacy, Introductory Courses, College Students
Hamby, Tyler – Journal of Psychoeducational Assessment, 2018
In this study, the author examined potential mediators of the negative relationship between the absolute difference in items' lengths and their inter-item correlation size. Fifty-two randomly ordered items from five personality scales were administered to 622 university students, and 46 respondents from a survey website rated the items'…
Descriptors: Correlation, Personality Traits, Undergraduate Students, Difficulty Level
Gencer, Muharrem; Tok, Türkay Nuri; Ordu, Aydan – International Journal of Assessment Tools in Education, 2018
While organizations have a power struggle with their environment and with other organizations in the globalised world, employees who are the most important resource of the organization also have power struggle among themselves. To be successful in this power struggle, employees, especially managers, use a number of political games in the…
Descriptors: Foreign Countries, Principals, Power Structure, Measures (Individuals)
Yelpaze, Ismail; Güler, Deniz – International Journal of Assessment Tools in Education, 2018
The aim of this study is to adapt the attitude scale towards asylum seekers and refugees to Turkish culture and to examine the relation of university students' attitudes towards asylum seekers to certain personality traits. The study was conducted with the participation of 340 university students. The attitude scales for both asylum seekers and…
Descriptors: College Students, Student Attitudes, Refugees, Altruism
Joo, Seang-Hwane; Lee, Philseok; Stark, Stephen – Journal of Educational Measurement, 2018
This research derived information functions and proposed new scalar information indices to examine the quality of multidimensional forced choice (MFC) items based on the RANK model. We also explored how GGUM-RANK information, latent trait recovery, and reliability varied across three MFC formats: pairs (two response alternatives), triplets (three…
Descriptors: Item Response Theory, Models, Item Analysis, Reliability
Barbosa, Miguel; Beeghly, Marjorie; Moreira, João; Tronick, Edward; Fuertes, Marina – Developmental Psychology, 2018
This study examined the stability of three patterns of infant regulatory behavior identified in the face-to-face still-face (FFSF) paradigm at 3 and 9 months--social-positive oriented, distressed-inconsolable, and self-comfort oriented--and whether variations in infants' heart-rate were correlated with them. Although some studies have examined the…
Descriptors: Infants, Infant Behavior, Behavior Patterns, Age Differences
Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…
Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing
Karairmak, Özlem – International Journal for the Advancement of Counselling, 2018
Counseling self-efficacy is defined as a counselor's beliefs regarding their ability to counsel a client effectively. Larson et al. (1992) developed the Counseling Self-Estimate Inventory (COSE) to determine counselors' self-efficacy in the dimensions of microskills, counseling process, difficult client behavior, cultural competence, and awareness…
Descriptors: Counselors, Self Efficacy, Counselor Qualifications, Test Validity

Peer reviewed
Direct link
