Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Saux, Gaston; Ros, Christine; Britt, M. Anne; Stadtler, Marc; Burin, Debora I.; Rouet, Jean-François – Discourse Processes: A Multidisciplinary Journal, 2018
In two experiments, undergraduate students read short texts containing two embedded sources that could either agree or disagree with each other. Participants' memory for the sources' identity (i.e., occupation) and features (i.e., the source's access to knowledge and the source's physical appearance) was examined as a function of the consistency…
Descriptors: Recall (Psychology), Reading, Undergraduate Students, Information Sources
Chrismas, Bryna; Taylor, Lee; Smith, Alexander; Pemberton, Philip; Siegler, Jason Charles; Midgley, Adrian Wayne – Measurement in Physical Education and Exercise Science, 2018
To examine the reproducibility of three measurement techniques used to determine creatine kinase, interleukin-6 and high-sensitivity C-reactive protein, 50 participants had blood samples taken on two occasions. Fingertip plasma samples were analysed using the Reflotron for CK determination. Venous blood samples collected into serum separator tubes…
Descriptors: Measurement Techniques, Reliability, Biochemistry, Correlation
Looney, Marilyn A. – Measurement in Physical Education and Exercise Science, 2018
The purpose of this article was two-fold (1) provide an overview of the commonly reported and under-reported absolute agreement indices in the kinesiology literature for continuous data; and (2) present examples of these indices for hypothetical data along with recommendations for future use. It is recommended that three types of information be…
Descriptors: Interrater Reliability, Evaluation Methods, Kinetics, Indexes
Akbay, Lokman; Kilinç, Mustafa – International Journal of Assessment Tools in Education, 2018
Measurement models need to properly delineate the real aspect of examinees' response processes for measurement accuracy purposes. To avoid invalid inferences, fit of examinees' response data to the model is studied through "person-fit" statistics. Misfit between the examinee response data and measurement model may be due to invalid…
Descriptors: Reliability, Goodness of Fit, Cognitive Measurement, Models
Tang, Xiaodan; Yin, Yue; Lin, Qiao; Hadad, Roxana – AERA Online Paper Repository, 2018
Computational thinking (CT) has been recognized as an essential part of every child's education (Wing, 2006) and Bebras contest items have been frequently used to measure CT. Think alouds has emerged as a prominent method for identifying thought processes and correcting problems with assessments. As little research has examined validity evidence…
Descriptors: Computation, Thinking Skills, Measures (Individuals), Psychometrics
Center on Standards and Assessments Implementation, 2018
Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…
Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias
Benton, Tom – Cambridge Assessment, 2018
One of the questions with the longest history in educational assessment is whether it is possible to increase the reliability of a test simply by altering the way in which scores on individual test items are combined to make the overall test score. Most usually, the score available on each item is communicated to the candidate within a question…
Descriptors: Test Items, Scoring, Predictive Validity, Test Reliability
Kaldo, Indrek; Õun, Kandela – Problems of Education in the 21st Century, 2019
This research reports learning strategies of the first-year Estonian university students in mathematics. The data were collected during two years from 440 university students of different disciplines. The respondents were among students who take at least one compulsory mathematics course during their first study year. The participants filled out a…
Descriptors: Factor Structure, Learning Strategies, Foreign Countries, College Freshmen
Wright, Jason Leonard; Caldarella, Paul; Sudweeks, Richard R.; Anderson, Darlene H.; Heath, Melissa A.; Williams, Leslie – Education, 2019
Social validity focuses on a program's goals, procedures, and outcome, which helps determine if an intervention is socially acceptable and valued. This study represents a conceptual replication of the investigation by Lane and colleagues (2009) regarding the psychometric properties of the Primary Intervention Rating Scale (PIRS), a teacher survey…
Descriptors: Psychometrics, Rating Scales, Positive Behavior Supports, Factor Analysis
Saraiva, Renan Benigno; van Boeijen, Inger Mathilde; Hope, Lorraine; Horselenberg, Robert; Sauerland, Melanie; van Koppen, Peter J. – Applied Cognitive Psychology, 2019
Metamemory can be defined as the knowledge about one's memory capabilities and about strategies that can aid memory. In this paper, we describe the development and validation of the Eyewitness Metamemory Scale (EMS), tailored specifically for use in face memory and eyewitness identification settings. Participants (N = 800) completed the EMS and…
Descriptors: Metacognition, Memory, Recognition (Psychology), Human Body
Grapin, Sally L.; Benson, Nicholas F. – Contemporary School Psychology, 2019
The Every Student Succeeds Act (ESSA) aims to ensure that all students are college- and career-ready by requiring all schools to implement high-quality accountability systems and services for students. The ESSA impacts assessment practices in schools by requiring staff to account for a broader range of variables related to student well-being,…
Descriptors: Educational Legislation, Federal Legislation, Elementary Secondary Education, Evaluation Methods
Ghanem, Bassam O.; Awwad, Ferial M. Abu – International Education Studies, 2019
The goal of this study is to determine the degree of leadership skills practiced by principals at UNRWA schools. In order to achieve this goal a questionnaire consisting of 56 items was developed, which included two domains: Administrative and Technical skills, and personal and social skills. This questionnaire had been verified in terms of its…
Descriptors: Principals, Leadership Qualities, Teacher Administrator Relationship, Teacher Attitudes
Cowan, John – Active Learning in Higher Education, 2019
This article presents the case for the use of the 'think-aloud protocol' by teachers who engage in action-research as a source of constructive information about their students' cognitive learning processes. This method calls upon learners to talk their thoughts out aloud, during engagement in some learning activity regarding which the researching…
Descriptors: Protocol Analysis, Educational Research, Action Research, Learning Processes
Sideridis, Georgios D.; Tsaousis, Ioannis; Al-Sadaawi, Abdullah – Educational and Psychological Measurement, 2019
The purpose of the present study was to apply the methodology developed by Raykov on modeling item-specific variance for the measurement of internal consistency reliability with longitudinal data. Participants were a randomly selected sample of 500 individuals who took on a professional qualifications test in Saudi Arabia over four different…
Descriptors: Test Reliability, Test Items, Longitudinal Studies, Foreign Countries
Renshaw, Tyler L.; Cook, Clayton R. – Journal of Psychoeducational Assessment, 2019
This brief report presents preliminary psychometrics of responses to the Youth Externalizing Problems Screener (YEPS), which is a 10-item self-report rating scale intended for use as a screening instrument. The YEPS was designed to function as a companion measure to the Youth Internalizing Problems Screener (YIPS), facilitating the screening of…
Descriptors: Screening Tests, Psychometrics, Rating Scales, High School Students

Peer reviewed
Direct link
