Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Jensen, Todd M.; Brigham, Rebecca B.; Rosenfeld, Larry J. – Journal of Social Work Education, 2019
There is a dearth of research on the evaluation of the psychometric performance of instruments designed to measure students' generalist-level social work competencies. There is also uncertainty on the performance of various response option formats used to measure students' competencies in assessment instruments. Using a sample of 198 master of…
Descriptors: Social Work, Job Skills, Masters Programs, Graduate Students
Abdelsamea, Mohammed Abdelhady; Bart, William – International Journal of Teaching and Learning in Higher Education, 2019
Although there is a robust body of research that has addressed the psychometric properties of the Learning and Study Strategies Inventory (LASSI) in different populations, no study has yet investigated the factor structure and congeneric reliability of the Arabic version of the Learning and Study Strategies Inventory, 2nd edition (LASSI-II) among…
Descriptors: Semitic Languages, Undergraduate Students, Factor Analysis, Factor Structure
Østerlie, Ove; Løhre, Audhild; Haugan, Gørill – Scandinavian Journal of Educational Research, 2019
One of the main aims of the school subject physical education (PE) is to promote a lifelong healthy lifestyle. The expectancy-value theory represents an essential theoretical perspective to examine and understand adolescents' learning and motivation in PE. Based on this theory, the Expectancy-Value Questionnaire (EVQ) measures students'…
Descriptors: Physical Education, Student Attitudes, Construct Validity, Foreign Countries
Briggs, Derek C.; Alzen, Jessica L. – Educational and Psychological Measurement, 2019
Observation protocol scores are commonly used as status measures to support inferences about teacher practices. When multiple observations are collected for the same teacher over the course of a year, some portion of a teacher's score on each occasion may be attributable to the rater, lesson, and the time of year of the observation. All three of…
Descriptors: Observation, Inferences, Generalizability Theory, Scores
Kleijn, Suzanne; Pander Maat, Henk; Sanders, Ted – Language Testing, 2019
Although there are many methods available for assessing text comprehension, the cloze test is not widely acknowledged as one of them. Critiques on cloze testing center on its supposedly limited ability to measure comprehension beyond the sentence. However, these critiques do not hold for all types of cloze tests; the particular configuration of a…
Descriptors: Cloze Procedure, Language Tests, Semantics, Scoring
Wesolowski, Brian C. – Journal of Educational Measurement, 2019
The purpose of this study was to build a Random Forest supervised machine learning model in order to predict musical rater-type classifications based upon a Rasch analysis of raters' differential severity/leniency related to item use. Raw scores (N = 1,704) from 142 raters across nine high school solo and ensemble festivals (grades 9-12) were…
Descriptors: Item Response Theory, Prediction, Classification, Artificial Intelligence
Shavelson, Richard J.; Zlatkin-Troitschanskaia, Olga; Beck, Klaus; Schmidt, Susanne; Marino, Julian P. – International Journal of Testing, 2019
Following employers' criticisms and recent societal developments, policymakers and educators have called for students to develop a range of generic skills such as critical thinking ("twenty-first century skills"). So far, such skills have typically been assessed by student self-reports or with multiple-choice tests. An alternative…
Descriptors: Critical Thinking, Cognitive Tests, Performance Based Assessment, Student Evaluation
Niksadat, Negin; Rakhshanderou, Sakineh; Negarandeh, Reza; Ramezankhani, Ali; Vasheghani Farahani, Ali; Ghaffari, Mohtasham – American Journal of Health Education, 2019
Background: The existing literature supports the application of the principles of andragogy on patient education. But there is a lack of a suitable tool for assessing patient education's conformity with these principles. Purpose: This study was conducted to develop and evaluate the psychometric properties of a questionnaire that measures the…
Descriptors: Test Construction, Questionnaires, Psychometrics, Andragogy
Borowiec, Katrina; Castle, Courtney – Practical Assessment, Research & Evaluation, 2019
Rater cognition or "think-aloud" studies have historically been used to enhance rater accuracy and consistency in writing and language assessments. As assessments are developed for new, complex constructs from the "Next Generation Science Standards (NGSS)," the present study illustrates the utility of extending…
Descriptors: Evaluators, Scoring, Scoring Rubrics, Protocol Analysis
Esendemir, Ozan; Bindak, Recep – International Journal of Educational Methodology, 2019
"Mathematical knowledge for teaching" is a concept indicating the requirement for a specific kind of knowledge required to teach mathematics. Mathematical knowledge for teaching necessitates a more complex structure than what is required to carry out mathematical tasks and the knowledge to do that. The purpose of this study is to realize…
Descriptors: Knowledge Base for Teaching, Mathematics Instruction, Knowledge Level, Geometry
Al-Hoorie, Ali H.; Vitta, Joseph P. – Language Teaching Research, 2019
This report presents a review of the statistical practices of 30 journals representative of the second language field. A review of 150 articles showed a number of prevalent statistical violations including incomplete reporting of reliability, validity, non-significant results, effect sizes, and assumption checks as well as making inferences from…
Descriptors: Periodicals, Second Language Learning, Second Language Instruction, Reliability
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Assessment for Effective Intervention, 2019
This study describes the development and initial psychometric evaluation of the Recognizing Effective Special Education Teachers (RESET) observation instrument. The study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of performance levels,…
Descriptors: Teacher Evaluation, Special Education Teachers, Scoring Rubrics, Observation
Neumann, Michelle M.; Worrall, Sheena; Neumann, David L. – Journal of Research on Technology in Education, 2019
Touch-screen tablets are used in the classroom for assessment. Little is known about the psychometric properties of tablet-based assessments. This study examined the validity and reliability of an expressive and receptive assessment app designed to measure literacy skills. Children (N = 45; 3-5 years) completed the app assessments for alphabet and…
Descriptors: Handheld Devices, Emergent Literacy, Computer Assisted Testing, Test Validity
Alfrey, Laura; O'Connor, Justen; Phillipson, Sivanes; Penney, Dawn; Jeanes, Ruth; Phillipson, Shane – European Physical Education Review, 2019
Healthism is both an ideological and a regulative discourse that manifests as a tendency to conceive health as a product of individual choice. Healthism represents a collection of taken-for-granted assumptions, positioned at the intersection of morality, blame and health, that can lead to a privileging of 'healthy' and 'productive' individuals. It…
Descriptors: Preservice Teachers, Physical Education Teachers, Student Attitudes, Test Construction
Chalmers, Kerry A.; Freeman, Emily E. – Journal of Psychoeducational Assessment, 2019
Low working memory (WM) capacity has been linked to poor academic performance and problem behavior. Availability of easy-to-administer screening tests would facilitate early detection of WM deficits. This study investigated the psychometric properties of the Working Memory Power Test for Children (WMPT) in 170 Australian schoolchildren (8½-11…
Descriptors: Short Term Memory, Academic Achievement, Behavior Problems, Correlation

Peer reviewed
Direct link
