Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 45 |
| Since 2017 (last 10 years) | 124 |
| Since 2007 (last 20 years) | 306 |
Descriptor
| Models | 643 |
| Test Validity | 643 |
| Test Reliability | 243 |
| Test Construction | 172 |
| Foreign Countries | 123 |
| Evaluation Methods | 109 |
| Factor Analysis | 103 |
| Psychometrics | 90 |
| Test Items | 83 |
| Higher Education | 68 |
| Questionnaires | 60 |
| More ▼ | |
Source
Author
| Hambleton, Ronald K. | 6 |
| French, Brian F. | 4 |
| Kane, Michael T. | 4 |
| Baker, Eva L. | 3 |
| Bejar, Isaac I. | 3 |
| Goh, Pauline Swee Choo | 3 |
| Hansen, Duncan N. | 3 |
| Huff, Kristen | 3 |
| Miller, M. David | 3 |
| Rock, Donald A. | 3 |
| Wong, Kung-Teck | 3 |
| More ▼ | |
Publication Type
Education Level
Location
| Australia | 12 |
| China | 12 |
| Canada | 9 |
| Malaysia | 8 |
| United Kingdom | 8 |
| Indonesia | 6 |
| Spain | 6 |
| Germany | 5 |
| Netherlands | 5 |
| Texas | 5 |
| Turkey | 5 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Funda Ugurlu; Filiz Evran Acar – Journal of Pedagogical Research, 2025
The aim of this study is to develop a valid and reliable measurement tool to identify teachers' tendencies towards professional development models. In line with the purpose of scale development, a survey model was preferred. The scale was designed to be applicable to teachers from various disciplines currently working in any institution…
Descriptors: Measures (Individuals), Test Reliability, Test Validity, Faculty Development
Ahmet Ayaz; Metin Piskin – Turkish Journal of Education, 2025
This research aims to develop the Career Renewal Power (CRP) Model, which explains career transition processes through the concepts of need perception and need satisfaction, based on Glasser's Choice Theory. To investigate the structure and functioning of the model, two qualitative studies were conducted. Subsequently, the CRP Scale was developed,…
Descriptors: Career Development, Models, Need Gratification, Measures (Individuals)
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Sean N. Weeks; Tyler L. Renshaw; Allysia A. Rainey; Aubrey Hiatt – Journal of Emotional and Behavioral Disorders, 2024
Internalizing and externalizing problems are common targets for school mental health screening. Prior research supports the interpretation of scores from the Youth Internalizing Problems Screener (YIPS) and the Youth Externalizing Problems Screener (YEPS), which were developed separately yet intended as companion measures. We extended previous…
Descriptors: Adolescents, Screening Tests, Behavior Problems, Mental Health
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Joanna Williamson – Cambridge University Press & Assessment, 2023
There is a lot of interest in providing detailed reports to schools indicating which skills pupils have mastered and which still need development -- and, more broadly, the knowledge, skills and understanding that pupils have acquired and not yet acquired. Cognitive diagnostic assessment is an approach designed to provide this kind of insight.…
Descriptors: Intelligence Tests, Diagnostic Tests, Test Construction, Mastery Learning
Subarkah, Edi; Kartowagiran, Badrun; Sumarno; Hamdi, Syukrul; Rahim, Abdul – International Journal of Educational Methodology, 2022
This research aims to develop the product of the life skill education program (LSEP) which is accurate, credible, and effective. This research used the Plomp model. The model covers the input, process, output, outcome and consists of instrument, scoring guidance, and good or bad criteria. The instruments used in the model are the questionnaire,…
Descriptors: Daily Living Skills, Questionnaires, Observation, Test Validity
Peter Elsborg; Paulina S. Melby; Mette Kurtzhals; Helene Kirkegaard; Johannes Carl; Steffen Rask; Peter Bentsen; Glen Nielsen – Measurement in Physical Education and Exercise Science, 2024
This study aimed to develop and test MyPL, a questionnaire that measures self-reported physical literacy (PL) among children and adolescents. First, the item pool was developed and adapted, and face validity was tested with cognitive interviewing. Then, factor structures were identified through multidimensional scaling and exploratory factor…
Descriptors: Questionnaires, Children, Early Adolescents, Models
Yan Jin; Jason Fan – Language Assessment Quarterly, 2023
In language assessment, AI technology has been incorporated in task design, assessment delivery, automated scoring of performance-based tasks, score reporting, and provision of feedback. AI technology is also used for collecting and analyzing performance data in language assessment validation. Research has been conducted to investigate the…
Descriptors: Language Tests, Artificial Intelligence, Computer Assisted Testing, Test Format
Sternberg, Robert J. – Journal of Creative Behavior, 2020
Creativity testing as it is now done is often based on a defective assumption that different kinds of creativity can be compressed into a single unidimensional scale. There is no reason to believe that the different kinds of creativity represent, simply, different amounts of a single unidimensional construct. The article shows how three different…
Descriptors: Creativity Tests, Test Validity, Misconceptions, Models
Xie, Chen; Song, Pingping; Hu, Huimin – Asia-Pacific Education Researcher, 2021
Echoing research interests in recent concepts and models of teacher leadership, the focus of existing scales needs to be updated, and their quality needs to be improved. This study summarizes common ground of influential definitions, models, and frameworks for teacher leadership, proposes a six-factor model (association, professional learning,…
Descriptors: Teacher Leadership, Test Construction, Test Validity, Measures (Individuals)
Chen Zong; Mirian Howland Cummings; Carolyn Haug; Nancy L. Leech – Research in the Schools, 2025
Faculty at institutions of higher education work longer hours and many are burned out and experience low satisfaction and low coping ability. To investigate this phenomenon, this study validated the instrument of Core Self-Evaluations Scale with a sample of higher education faculty members in the U.S., and three different theoretical models (Gu et…
Descriptors: Teacher Attitudes, Beliefs, Self Concept, College Faculty

Peer reviewed
Direct link
