Publication Date
In 2025 | 2 |
Since 2024 | 10 |
Since 2021 (last 5 years) | 34 |
Since 2016 (last 10 years) | 103 |
Since 2006 (last 20 years) | 205 |
Descriptor
Predictive Validity | 466 |
Test Reliability | 466 |
Test Validity | 297 |
Test Construction | 78 |
Foreign Countries | 71 |
Factor Analysis | 65 |
Psychometrics | 63 |
Correlation | 60 |
Screening Tests | 60 |
Predictive Measurement | 49 |
Measures (Individuals) | 47 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Turkey | 8 |
Netherlands | 7 |
Canada | 6 |
Australia | 5 |
China | 5 |
United Kingdom | 5 |
Florida | 4 |
Taiwan | 4 |
California | 3 |
Indiana | 3 |
New York | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Bilingual Education Act 1968 | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024
The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…
Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures
Canan Keles Ertürk; Kezban Tepeli – International Journal of Assessment Tools in Education, 2024
This study aims to conduct the Turkish adaptation, validity, and reliability study of the Theory of Mind Inventory-2 (TOMI-2) developed by Hutchins and Prelock (2016) for 3-5-year-old children. The study group consists of 310 mothers with children in the 3-5 age group in Konya city center. Personal Information Form and Theory of Mind Inventory-2…
Descriptors: Foreign Countries, Theory of Mind, Measures (Individuals), Test Validity
Juan Li; Jingyao Wang; Bowen Xiao; Yan Li; Hui Li – Early Education and Development, 2024
"Research Findings": The Problematic Media Use Measure (PMUM) assesses young children's excessive or problematic media use. The current study aims to translate the original English version of PMUM into a Chinese version and validate it with Chinese preschoolers (ages 3 to 6). The instrument was translated, back-translated, pretested, and…
Descriptors: Translation, Test Validity, Chinese, Measures (Individuals)
Richmond, Aaron S. – College Teaching, 2022
I created the 15-item Learner-Centered Syllabus Scale (LCSS) based on the work of Cullen and Harris. The purpose of this study was to assess the factor structure, reliability, and validity of the LCSS. Four blind coders assessed 175 syllabi using the LCSS with 92% inter-rater agreement. To establish concurrent validity, each blind coder rated the…
Descriptors: Student Centered Learning, Course Descriptions, Measures (Individuals), Test Reliability
Schrodt, Katie; FitzPatrick, Erin; Brown, Megan; Hover, Ashlee – Reading & Writing Quarterly, 2023
Motivation impacts student academic performance. A performance task to directly assess writing motivation in young children is needed. The purpose of this investigation was to evaluate the validity of the Writing Challenge Task (WCT), a task-oriented assessment created to measure writing motivation with 106 kindergarten students in the rural…
Descriptors: Writing Assignments, Evaluation Methods, Student Motivation, Kindergarten
Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024
Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…
Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students
Lee, Ji Young; Sung, Jihyun – Early Education and Development, 2022
This study aims to validate a Korean version of the Brief Version of the Child Abuse Potential Inventory (BCAP-K) for use with childcare providers in South Korea. By employing a stratified sampling method, 808 childcare providers in charge of infants' classes were selected for participation. Participants completed a questionnaire that included…
Descriptors: Child Abuse, Test Validity, Test Reliability, Measures (Individuals)
Sudina, Ekaterina; Vernon, Tony; Foster, Henry; Del Villano, Heather; Hernandez, Shoshannah; Beck, Daniel; Plonsky, Luke – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2021
Grit--"perseverance and passion for long-term goals" (Duckworth, Peterson, Matthews, & Kelly, 2007, p. 1087)--has attracted the attention of researchers in fields ranging from psychology to business to education (e.g., Robertson-Kraft & Duckworth, 2014; Robins, 2019). Continuing the line of research that explores the domain…
Descriptors: Test Construction, Test Validity, English (Second Language), Language Teachers
Fraser, Barry J.; McLure, Felicity I.; Koul, Rekha B. – Learning Environments Research, 2021
In an attempt to engage more students in Science, Technology, Engineering and Mathematics (STEM) subjects, schools are encouraged by STEM educators and professionals to introduce students to STEM through projects which integrate skills from each of the STEM disciplines. Because little is known about the learning environment of STEM classrooms, we…
Descriptors: Classroom Environment, Psychological Patterns, STEM Education, Test Construction
Yongtian Cheng; K. V. Petrides – Educational and Psychological Measurement, 2025
Psychologists are emphasizing the importance of predictive conclusions. Machine learning methods, such as supervised neural networks, have been used in psychological studies as they naturally fit prediction tasks. However, we are concerned about whether neural networks fitted with random datasets (i.e., datasets where there is no relationship…
Descriptors: Psychological Studies, Artificial Intelligence, Cognitive Processes, Predictive Validity
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024
A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…
Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Gary A. Troia; Frank R. Lawrence; Julie S. Brehmer; Kaitlin Glause; Heather L. Reichmuth – Grantee Submission, 2023
Much of the research that has examined the writing knowledge of school-age students has relied on interviews to ascertain this information, which is problematic because interviews may underestimate breadth and depth of writing knowledge, require lengthy interactions with participants, and do not permit a direct evaluation of a prescribed array of…
Descriptors: Writing Tests, Writing Evaluation, Knowledge Level, Elementary School Students
Sutherland, Marah; Clarke, Ben; Nese, Joseph F. T.; Cary, Mari Strand; Shanley, Lina; Furjanic, David; Durán, Lillian – Grantee Submission, 2020
Drawing from the developmental and cognitive mathematics literature, the purpose of this study was to investigate the reliability, validity, and diagnostic utility of a widely-researched number line task in kindergarten. Specifically, the Number Line Assessment 0-100 (NLA 0-100) as compared to an established kindergarten screening measure was…
Descriptors: Mathematics Tests, Screening Tests, Test Reliability, Test Validity