Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 32 |
Since 2006 (last 20 years) | 71 |
Descriptor
Data Analysis | 172 |
Test Reliability | 172 |
Test Validity | 86 |
Test Construction | 51 |
Data Collection | 33 |
Evaluation Methods | 30 |
Research Methodology | 26 |
Foreign Countries | 24 |
Tables (Data) | 24 |
Comparative Analysis | 23 |
Measurement Techniques | 23 |
More ▼ |
Source
Author
Green, Donald Ross | 2 |
Hines, Constance V. | 2 |
Ho, Andrew D. | 2 |
Mandeville, Garrett K. | 2 |
Reardon, Sean F. | 2 |
Rowe, Wayne | 2 |
Stallings, Jane A. | 2 |
Walker, Debbie Klein | 2 |
Acar, Selcuk | 1 |
Adams, David R. | 1 |
Aiken, Lewis R. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 4 |
Practitioners | 3 |
Community | 1 |
Counselors | 1 |
Teachers | 1 |
Location
Turkey | 4 |
Australia | 3 |
Spain | 3 |
United Kingdom (England) | 3 |
United States | 3 |
California | 2 |
Denmark | 2 |
Ohio | 2 |
Taiwan | 2 |
Asia | 1 |
Brazil | 1 |
More ▼ |
Laws, Policies, & Programs
Bilingual Education Act 1968 | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Minju Hong – ProQuest LLC, 2022
Reliability indicates the internal consistency of a test. In educational studies, reliability is a key feature for a test. Researchers have proposed many traditional reliability estimates, such as coefficient alpha and coefficient omega. However, traditional reliability indices do not deal with the data hierarchy, even though the multilevel…
Descriptors: Hierarchical Linear Modeling, Factor Analysis, Factor Structure, Test Reliability
Tapprich, William E.; Reichart, Letitia; Simon, Dawn M.; Duncan, Garry; McClung, William; Grandgenett, Neal; Pauley, Mark A. – Biochemistry and Molecular Biology Education, 2021
The lack of an instructional definition of bioinformatics delays its effective integration into biology coursework. Using an iterative process, our team of biologists, a mathematician/computer scientist, and a bioinformatician together with an educational evaluation and assessment specialist, developed an instructional definition of the…
Descriptors: Scoring Rubrics, Definitions, Genetics, Biology
Philip E. Kearney; Niamh Curran; Frank J. Nugent – Journal of Motor Learning and Development, 2025
Manipulation checks are an essential component of quality experimental design in motor learning. Guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses framework, this methodological systematic review examined the utilization of manipulation checks in focus of attention research. Seventy-eight protocols from four…
Descriptors: Attention Control, Attention Span, Motor Development, Psychomotor Skills
Grajzel, Katalin; Dumas, Denis; Acar, Selcuk – Journal of Creative Behavior, 2022
One of the best-known and most frequently used measures of creative idea generation is the Torrance Test of Creative Thinking (TTCT). The TTCT Verbal, assessing verbal ideation, contains two forms created to be used interchangeably by researchers and practitioners. However, the parallel forms reliability of the two versions of the TTCT Verbal has…
Descriptors: Test Reliability, Creative Thinking, Creativity Tests, Verbal Ability
Sickler, Jessica; Bardar, Erin; Kochevar, Randy – Journal of College Science Teaching, 2021
Data literacy, or students' abilities to understand, interpret, and think critically about data, is an increasing need in K-16 science education. Ocean Tracks College Edition (OTCE) sought to address this need by creating a set of learning modules that engage students in using large-scale, professionally collected animal migration and physical…
Descriptors: Information Literacy, Data Analysis, Undergraduate Students, Scoring Rubrics
Isolda Margarita Castillo-Martínez; Davis Velarde-Camaqui; María Soledad Ramírez-Montoya; Jorge Sanabria-Z – Journal of Social Studies Education Research, 2024
Reasoning for complexity is a fundamental competency in these complex times for solutions to social problems and decision-making. The purpose of this paper is to demonstrate the validity and reliability of the eComplexity instrument by presenting its psychometric properties. The instrument consists of a Likert-type scale questionnaire designed to…
Descriptors: Psychometrics, Test Validity, Test Reliability, Difficulty Level
Öz, Serap; Özdemir, Ali – International Journal of Contemporary Educational Research, 2022
The purpose of this study is to develop a valid and reliable Likert-type scale that can be used to measure the data literacy skills of educators. In the development process of the scale, after reviewing the relevant literature, a pool of 130 items was designed and presented to the experts for their view. After the evaluation of experts, the…
Descriptors: Likert Scales, Test Construction, Construct Validity, Test Reliability
Fürst, Guillaume – Journal of Creative Behavior, 2020
This paper introduces a method for the assessment of creativity that relies on creativity tasks, a subjective evaluation procedure, and a planned missing data design that offers a drastic reduction in the overall implementation costs (administration time and scoring procedure). This method was tested on a sample of 149 people, using three…
Descriptors: Creativity, Creativity Tests, Task Analysis, Creative Thinking
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Elturki, Eman – English Teaching Forum, 2020
Accrediting agencies for English language programs, such as the Commission on English Language Program Accreditation (CEA), require a plan in writing for monitoring and reviewing assessment practices. Nonetheless, web-search queries such as "assessing assessment," "how to assess assessment," "assessing assessment…
Descriptors: College Second Language Programs, English (Second Language), Student Evaluation, Test Reliability
Sun, Xiao-Ming – Journal of Speech, Language, and Hearing Research, 2016
Purpose: The purpose of this study was to present normative data of tympanometric measurements of wideband acoustic immittance and to characterize wideband tympanograms. Method: Data were collected in 84 young adults with strictly defined normal hearing and middle ear status. Energy absorbance (EA) was measured using clicks for 1/12-octave…
Descriptors: Acoustics, Test Reliability, Adults, Hearing (Physiology)
Stirk, Steven; Field, Bryony; Black, Jessica – Journal of Applied Research in Intellectual Disabilities, 2018
Background: The Learning Disability Screening Questionnaire (LDSQ) has been shown to have high sensitivity and specificity to identify those who are likely to meet intellectual disability diagnostic criteria (McKenzie, et al. [McKenzie K., 2015]). However, there is no independent research to date to support these findings. Materials and Methods:…
Descriptors: Learning Disabilities, Questionnaires, Screening Tests, Diagnostic Tests
Vaske, Jerry J. – Sagamore-Venture, 2019
Data collected from surveys can result in hundreds of variables and thousands of respondents. This implies that time and energy must be devoted to (a) carefully entering the data into a database, (b) running preliminary analyses to identify any problems (e.g., missing data, potential outliers), (c) checking the reliability and validity of the…
Descriptors: Surveys, Theories, Hypothesis Testing, Effect Size
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability