Publication Date
In 2025 | 172 |
Since 2024 | 669 |
Since 2021 (last 5 years) | 2217 |
Since 2016 (last 10 years) | 4144 |
Since 2006 (last 20 years) | 6674 |
Descriptor
Test Construction | 16492 |
Test Validity | 5710 |
Test Reliability | 4241 |
Foreign Countries | 3558 |
Test Items | 2673 |
Higher Education | 1960 |
Evaluation Methods | 1850 |
Factor Analysis | 1849 |
Psychometrics | 1710 |
Elementary Secondary Education | 1699 |
Student Evaluation | 1572 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 643 |
Teachers | 450 |
Researchers | 436 |
Administrators | 124 |
Policymakers | 68 |
Students | 66 |
Counselors | 25 |
Parents | 24 |
Community | 10 |
Support Staff | 5 |
Media Staff | 3 |
More ▼ |
Location
Turkey | 575 |
Australia | 334 |
Canada | 251 |
China | 165 |
United States | 142 |
Indonesia | 135 |
United Kingdom | 128 |
Germany | 112 |
California | 107 |
Taiwan | 107 |
United Kingdom (England) | 105 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 2 |
Osman Birgin; Elif Seval Peker – Psychology in the Schools, 2025
The aim of this study was to develop an instrument for assessing sixth-grade students' number sense skills in fractions and decimals. This study was conducted on 452 sixth graders (10-11 years old) from the western region of Turkey. The construct validity of the number sense test (NST) was examined via exploratory factor analysis (EFA) and…
Descriptors: Foreign Countries, Grade 6, Test Construction, Mathematics Education
Haixiang Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Mediation analysis is an important statistical tool in many research fields, where the joint significance test is widely utilized for examining mediation effects. Nevertheless, the limitation of this mediation testing method stems from its conservative Type I error, which reduces its statistical power and imposes certain constraints on its…
Descriptors: Structural Equation Models, Statistical Significance, Robustness (Statistics), Comparative Testing
Ioannis Vourletsis; Panagiotis Politis – Education and Information Technologies, 2025
Computational thinking (CT) is regarded as a valuable skill set for the students of the 21st century, fostering problem-solving skills applicable to academic disciplines and everyday problems. Assessing CT involves evaluating the development of its concepts, practices, and perspectives. However, establishing comprehensive and validated assessments…
Descriptors: Computation, Thinking Skills, Elementary School Students, Translation
Montserrat Cubillos; Mónica Zegers; Himilcon Inciarte – Reading Research Quarterly, 2025
This study aimed to design and validate the Teacher-Reported Reading Engagement Survey (TRRES) to complement self-reported measures and comprehensively assess reading engagement among adolescents. Drawing insights from literature and expert feedback, a new 10-item Likert scale instrument was created, capturing three facets of reading engagement:…
Descriptors: Adolescents, Reading Attitudes, Test Construction, Test Validity
Haokun Liu – International Journal of Multilingualism, 2025
Globally, countries or regions across from east to west like Hong Kong, Macao, Taiwan, Singapore, the United Kingdom, and the United States have incorporated language item questions in their censuses. The assessment of such design advantages and disadvantages is crucial for academic investigation. Despite ongoing discussions, there is a noticeable…
Descriptors: Language Usage, Demography, Surveys, Questionnaires
Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025
This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…
Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics
Laura M. Crothers; Taylor Steeves; Jered B. Kolbert; James B. Schreiber; Ara J. Schmitt; Brianna Drischler; Kelly Paulson; Jessica Cowley; Amelia Klass; Athena Vafiadis; Kayla Perfetto – Contemporary School Psychology, 2025
In this exploratory study, we adapted items from a previously developed measure of job satisfaction, the Measure of Job Satisfaction (MJS), an instrument first developed for use with community nurses in the UK, to create a brief, 15-item instrument (Job Satisfaction--Brief) applicable to practitioners of school psychology from Pennsylvania (N =…
Descriptors: Job Satisfaction, School Psychology, School Psychologists, Factor Structure
Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025
Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…
Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology
Mehmet Emin Ören; Servet Atik – International Journal of Assessment Tools in Education, 2025
In this study, it was aimed to adapt the DigiFuehr 2.0 Scale developed by Claassen et al. (2023) to Turkish and to conduct validity and reliability studies on three groups of participants consisting of teachers. In the study, exploratory and confirmatory factor analyses were performed in line with translation study, linguistic application, and…
Descriptors: Test Reliability, Test Validity, Test Construction, Translation
Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025
Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms
Joseph F. Mirabelli; Eileen M. Johnson; Sara R. Vohra; Jeanne L. Sanders; Karin J. Jensen – International Journal of STEM Education, 2025
Background: Undergraduate engineering students report increased rates of mental health distress. Evidence suggests that these students experience high stress, which can perpetuate mental health challenges. Further, engineering students may engage in help-seeking and self-care activities more rarely than students in other disciplines. We…
Descriptors: Undergraduate Students, Engineering Education, Mental Health, Stress Variables
Ed Harris; Katherine Curry; Jentre Olsen; Ashlyn Fiegener; Jam Khojasteh – Current Issues in Education, 2025
Wide agreement exists about the value and power of learning in social contexts, and social influences on learning have been studied from multiple perspectives. However, before this study, no known measure of the value of learning that happens in social spaces had been developed. This study introduces a scale to measure value created through…
Descriptors: Foreign Countries, Professional Development, Communities of Practice, Test Validity
Richard G. Kunkel – Journal of Legal Studies Education, 2024
For many professors, testing is primarily a tool for assessing the learning of students. However, research into the "testing effect" has established the value of testing also as a learning tool, not just as an assessment tool. This article provides an overview of this research and also of my own experiences in using a variety of testing…
Descriptors: Testing, Test Construction, College Students, Student Evaluation
Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024
Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…
Descriptors: Item Response Theory, Sample Size, Models, Classification
Turhan, A.; Roest, J. J.; Delforterie, M. J.; Van der Helm, G. H. P.; Neimeijer, E. G.; Didden, R. – Journal of Applied Research in Intellectual Disabilities, 2024
Background: In secure residential facilities, group climate perceptions of clients with mild intellectual disability or borderline intellectual functioning are systematically assessed for quality improvement. A valid and reliable measure may ensure that this process is consistent. The Group Climate Inventory--Revised (GCI-R) is a new measure to…
Descriptors: Psychometrics, Adults, Test Construction, Mild Intellectual Disability