Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Rujun Xu; James Soland – International Journal of Testing, 2024
International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…
Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries
Timothy Lycurgus; Ben B. Hansen – Society for Research on Educational Effectiveness, 2022
Background: Efficacy trials in education often possess a motivating theory of change: how and why should the desired improvement in outcomes occur as a consequence of the intervention? In scenarios with repeated measurements, certain subgroups may be more or less likely to manifest a treatment effect; the theory of change (TOC) provides guidance…
Descriptors: Educational Change, Educational Research, Intervention, Efficiency
Demirtas, Zülfü; Çaçan, Hanifi; Uslukaya, Alper – International Journal of Contemporary Educational Research, 2023
This work is intended to develop a measuring tool for determining teacher perception of informal relationships. The pool of items created by researchers through a literature review has been presented with expert assessment of the validity of the content, face, and meaning, and a draft scale has been created by making necessary revisions to the…
Descriptors: Foreign Countries, Teacher Attitudes, Likert Scales, Test Construction
Maïano, Christophe; Thibault, Isabelle; Dreiskämper, Dennis; Henning, Lena; Tietjens, Maike; Aimé, Annie – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of the French and German versions of the Physical Self-Concept Questionnaire for Elementary School Children-Revised (PSCQ-C-R). A sample of 519 children participated in this study. Of those, 197 were French-Canadian and 322 were German. Results support the factor validity and…
Descriptors: Elementary School Students, Self Concept, Human Body, Questionnaires
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Barrett, Michelle D.; van der Linden, Wim J. – Journal of Educational Measurement, 2017
Linking functions adjust for differences between identifiability restrictions used in different instances of the estimation of item response model parameters. These adjustments are necessary when results from those instances are to be compared. As linking functions are derived from estimated item response model parameters, parameter estimation…
Descriptors: Item Response Theory, Error of Measurement, Programming, Evaluation Methods
Alatli, Betul – International Journal of Assessment Tools in Education, 2020
This study aims to reveal the trends in the related field by examining the researches evaluating the measurement invariance in education and psychology between 2008-2019. Accordingly, 99 articles published in three journals that were selected using the purposive sampling method among the journals indexed on Social Sciences Citation Index (SSCI)…
Descriptors: Educational Research, Social Science Research, Psychology, Journal Articles
Bramley, Tom – Research Matters, 2020
The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…
Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy
Curby, Timothy; McKnight, Patrick; Alexander, Lisa; Erchov, Simone – Assessment & Evaluation in Higher Education, 2020
Evaluation of college instructors often centers on course ratings; however, there is little evidence that these ratings only reflect teaching. The purpose of this study was to assess the relative importance of three facets of course ratings: instructor, course and occasion. We sampled 2,459 fully-crossed dyads from a large university where two…
Descriptors: Student Evaluation of Teacher Performance, Course Evaluation, Error of Measurement, Teacher Effectiveness
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
Ramadhan, Syahrul; Sumiharsono, Rudy; Mardapi, Djemari; Prasetyo, Zuhdan Kun – International Journal of Instruction, 2020
The analysis of the Test Instruments' quality is a crucial thing needs to be conducted. The test instruments made by teachers must fulfil the requirements (validity, reliability, and standard error of measurement) until the measurement result obtained can describe the students' actual abilities. This research aims to analyse the content validity…
Descriptors: Foreign Countries, Teacher Made Tests, Content Validity, Test Reliability
Priemer, Burkhard; Hellwig, Julia – International Journal of Science and Mathematics Education, 2018
Estimating measurement uncertainties is important for experimental scientific work. However, this is very often neglected in school curricula and teaching practice, even though experimental work is seen as a fundamental part of teaching science. In order to call attention to the relevance of measurement uncertainties, we developed a comprehensive…
Descriptors: Measurement, Error of Measurement, Secondary School Students, Models
Tourangeau, Roger – Quality Assurance in Education: An International Perspective, 2018
Purpose: This paper aims to examine the cognitive processes involved in answering survey questions. It also briefly discusses how the cognitive viewpoint has been challenged by other approaches (such as conversational analysis). Design/methodology/approach: The paper reviews the major components of the response process and summarizes work…
Descriptors: Surveys, Cognitive Processes, Error of Measurement, Accuracy
Mangione, Kathleen K.; Macropol, Kathy; Jia, Yanxia; Tevald, Michael; Harris, Shane; Wolff, Edward; Craik, Rebecca – Measurement in Physical Education and Exercise Science, 2018
Heart rate (HR) by time curves could be useful as a measure of treatment fidelity (TF). The purposes were to describe the frequency of common recording irregularities (e.g. errors) observed during exercise, validate a process to correct those errors, and determine whether there is a clinically meaningful benefit to data correction. In total, 1895…
Descriptors: Exercise, Older Adults, Metabolism, Injuries
Greifer, Noah – ProQuest LLC, 2018
There has been some research in the use of propensity scores in the context of measurement error in the confounding variables; one recommended method is to generate estimates of the mis-measured covariate using a latent variable model, and to use those estimates (i.e., factor scores) in place of the covariate. I describe a simulation study…
Descriptors: Evaluation Methods, Probability, Scores, Statistical Analysis