Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…
Descriptors: Correlation, Test Items, Scores, Difficulty Level
Erdemir, Mustafa; Akyuz, Halil Ibrahim – Journal on School Educational Technology, 2020
The purpose of this study is to reduce ethics violations such as tricking and cheating that may occur in the offline assessment of undergraduate level Physics-II (Electricity) course subjects. The study is significant for the reliable and ethical evaluation of the Internet and computer-based educational process. Thirty-eight pre-service teachers…
Descriptors: Test Reliability, Ethics, Cheating, Undergraduate Students
Ruben Trigueros; Alejandro García-Mas – British Journal of Educational Psychology, 2025
Introduction: In recent years, the incorporation of novelty as a psychological need and the study of the frustration of needs have become a recurring theme in the research on psychological needs in the educational environment. Currently, there are two scales available to assess the frustration of basic psychological needs (FBN) in the context of…
Descriptors: Psychological Patterns, Well Being, Resilience (Psychology), Self Determination
Abdullah Alamer; Ahmed Al Khateeb; Abdulrahman Alshabeb – Language Assessment Quarterly, 2025
This study introduces the first Arabic Vocabulary Levels Test (Arabic-VLT), created for foreign learners of Arabic. We present compelling evidence to substantiate its validity and reliability. The Arabic-VLT was developed according to five levels, beginning with the most frequently used words (Level 1) to the least frequently used ones (Level 5),…
Descriptors: Arabic, Vocabulary Development, Test Construction, Second Language Learning
Lijun Shen; Zitsi Mirakhur; Sarah LaCour – Computer Science Education, 2025
Background and Context: Educators and researchers are interested in building the computational thinking (CT) skills of K-12 students. However, the availability of language-agnostic assessments for lower elementary graders remains limited. Objective: We present preliminary insights into the reliability and validity of the Computational Thinking…
Descriptors: Thinking Skills, Gender Differences, Computer Science Education, Elementary School Students
Tao Guo; Tianxin Li; Zhanyong Qi – European Journal of Education, 2025
This study examined the relationship between school service quality and student learning satisfaction in public and private high schools in China, considering the influence of students' socioeconomic background and household registration location. A comparative study was conducted using a questionnaire administered to 22,588 students in 20…
Descriptors: Foreign Countries, High School Students, High Schools, Public Schools
Emily B. Goldberg; Sheila R. Pratt; Malcolm R. McNeil; Neil Szuminsky; Kenneth DeHaan; Leslie Q. Zhen – Journal of Speech, Language, and Hearing Research, 2025
Purpose: The present study assessed the test-retest reliability of the American Sign Language (ASL) version of the Computerized Revised Token Test (CRTT-ASL) and compared the differences and similarities between ASL and English reading by Deaf and hearing users of ASL. Method: Creation of the CRTT-ASL involved filming, editing, and validating CRTT…
Descriptors: American Sign Language, Reliability, Validity, Test Construction
Julie Shi; Mike Nason; Marco Tullney; Juan Pablo Alperin – College & Research Libraries, 2025
Metadata are crucial for discovery and access by providing contextual, technical, and administrative information in a standard form. Yet metadata are also sites of tension between sociocultural representations, resource constraints, and standardized systems. Formal and informal interventions may be interpreted as quality issues, political acts to…
Descriptors: Metadata, Quality Control, Problems, Cross Cultural Studies
Min-Ying Tsai – SAGE Open, 2025
The study aimed to verify the psychometric properties of the emotional-style scale and explore different clusters of emotional styles. An emotional style scale was completed in three districts of western Taiwan using stratified random sampling. The study confirmed the reliability and convergent validity of each subscale in a sample of 712…
Descriptors: Foreign Countries, Psychometrics, Classification, Psychological Patterns
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Ahmad Goodarzi; Afsheen Rezai – Education and Information Technologies, 2025
Although examining teachers' various areas of knowledge in classroom practices has attracted considerable attention during recent decade, a short multidimensional questionnaire to assess the interactions of the knowledge domains with teachers' technological knowledge is unavailable in the literature. Thus, this research presents a self-assessment…
Descriptors: Technological Literacy, Pedagogical Content Knowledge, Language Teachers, English (Second Language)
Anatri Desstya; Ika Candra Sayekti; Muhammad Abduh; Sukartono – Journal of Turkish Science Education, 2025
This study aimed to develop a standardised instrument for diagnosing science misconceptions in primary school children. Following a developmental research approach using the 4-D model (Define, Design, Develop, Disseminate), 100 four-tier multiple choice items were constructed. Content validity was established through expert evaluation by six…
Descriptors: Test Construction, Science Tests, Science Instruction, Diagnostic Tests
Fabiola Ndayiragije; Arcade Nduwimana; Elvis Nizigama – Journal of English Teaching, 2025
The present study was undertaken to investigate the use of contextual situations in teaching English tenses to university students. To achieve this objective, the study examined (1) whether there was a statistically significant difference between students' scores on the pre-test and post-test on present tenses based on contextual situations, and…
Descriptors: Morphemes, Second Language Learning, Second Language Instruction, Teaching Methods
Derya Kaltakci-Gurel – Physical Review Physics Education Research, 2025
This research investigated how freshman students' epistemological beliefs in physics are impacted by gender and academic performance in a general physics course. Data were collected from 1220 university freshman students from 22 different programs in Türkiye. In this causal-comparative research, the well-known Colorado Learning Attitudes about…
Descriptors: College Freshmen, Gender Differences, Physics, Science Instruction

Peer reviewed
Direct link
