Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 28 |
Descriptor
Test Items | 39 |
Foreign Countries | 38 |
Test Validity | 22 |
Test Construction | 15 |
Scores | 11 |
Validity | 11 |
Test Reliability | 10 |
Item Analysis | 9 |
Psychometrics | 9 |
Test Bias | 9 |
College Students | 8 |
More ▼ |
Source
Author
Cui, Ying | 2 |
Kam, Chester Chun Seng | 2 |
Leighton, Jacqueline P. | 2 |
Norris, Stephen P. | 2 |
Pollock, Carol | 2 |
Rogers, W. Todd | 2 |
Aimé, Annie | 1 |
Andrews, Jac J. W. | 1 |
Barss, Joseph | 1 |
Bateson, David J. | 1 |
Besser, Avi | 1 |
More ▼ |
Publication Type
Journal Articles | 34 |
Reports - Research | 31 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 3 |
Dissertations/Theses -… | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 13 |
Postsecondary Education | 13 |
Elementary Secondary Education | 5 |
Elementary Education | 2 |
Grade 6 | 2 |
High Schools | 2 |
Middle Schools | 2 |
Early Childhood Education | 1 |
Grade 11 | 1 |
Grade 3 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Location
Canada | 39 |
Georgia | 2 |
Michigan | 2 |
Australia | 1 |
California | 1 |
Connecticut | 1 |
Europe | 1 |
Florida | 1 |
France | 1 |
Germany | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Nazli Uygun Emil – ProQuest LLC, 2020
Validity of a measurement refers to appropriate test score meanings, uses, and interpretations (Messick, 1989; Kane, 1992). There are different approaches to validity: an evidentiary aspect of validity is one requiring gathering statistical evidence to evaluate test score meaning. A common approach to validation is comparisons of test score equity…
Descriptors: Educational Quality, Mathematics Tests, Test Validity, Test Reliability
Jean-Yves Bégin; Luc Touchette; Caroline Couture; Cassandre Blais – International Journal of Nurture in Education, 2020
The Boxall Profile provides a framework for the structured observation of children in nurture groups. It is a detailed and rigorously trialled normative diagnostic instrument developed for teachers and teaching assistants to measure children's levels of emotional and behavioural functioning. Moreover, it highlights specific targets for…
Descriptors: Psychometrics, French, Observation, Children
Development and Application of the Social Justice Teacher Leader Self-Assessment (SJTLSA) Instrument
Smith, Cathryn – International Journal of Leadership in Education, 2023
This article describes the processes employed in developing the Social Justice Teacher Leader Self-Assessment (SJTLSA), a tool designed to foster teacher leader self-reflection, stimulate collegial dialogue, assess school culture, and direct social justice initiatives. Tool development procedures included examining precedents, developing a…
Descriptors: Social Justice, Teacher Leadership, Self Evaluation (Individuals), Measures (Individuals)
Whitaker, Douglas; Barss, Joseph; Drew, Bailey – Online Submission, 2022
Challenges to measuring students' attitudes toward statistics remain despite decades of focused research. Measuring the expectancy-value theory (EVT) Cost construct has been especially challenging owing in part to the historical lack of research about it. To measure the EVT Cost construct better, this study asked university students to respond to…
Descriptors: Statistics Education, College Students, Student Attitudes, Likert Scales
Maïano, Christophe; Thibault, Isabelle; Dreiskämper, Dennis; Henning, Lena; Tietjens, Maike; Aimé, Annie – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of the French and German versions of the Physical Self-Concept Questionnaire for Elementary School Children-Revised (PSCQ-C-R). A sample of 519 children participated in this study. Of those, 197 were French-Canadian and 322 were German. Results support the factor validity and…
Descriptors: Elementary School Students, Self Concept, Human Body, Questionnaires
Chen, Michelle Y.; Flasko, Jennifer J. – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2020
Seeking evidence to support content validity is essential to test validation. This is especially the case in contexts where test scores are interpreted in relation to external proficiency standards and where new test content is constantly being produced to meet test administration and security demands. In this paper, we describe a modified…
Descriptors: Foreign Countries, Reading Tests, Language Tests, English (Second Language)
Scribner, Emily D.; Harris, Sara E. – Journal of Geoscience Education, 2020
The Mineralogy Concept Inventory (MCI) is a statistically validated 18-question assessment that can be used to measure learning gains in introductory mineralogy courses. Development of the MCI was an iterative process involving expert consultation, student interviews, assessment deployment, and statistical analysis. Experts at the two universities…
Descriptors: Undergraduate Students, Mineralogy, Introductory Courses, Science Tests
Johnson, Stacey; Vuillemin, Anne; Geidne, Susanna; Kokko, Sami; Epstein, Jonathan; Van Hoye, Aurélie – Health Education & Behavior, 2020
Settings-based approaches have become an increasing health promotion focus since the World Health Organization's 1986 Ottawa Charter. While schools, cities, and prisons have implemented this approach, its development within sports environments is recent. Sports are a popular leisure-time activity, requiring validated tools to measure health…
Descriptors: Clubs, Health Promotion, Athletics, Test Construction
Hoffmann, Matt D.; Loughead, Todd. M. – Measurement in Physical Education and Exercise Science, 2019
Using a multiphase approach, the purpose of the present study was to develop a psychometrically sound questionnaire to measure protégés' perceptions of peer athlete mentoring functions. Phase 1 consisted of three stages: (a) item development, (b) assessment of content validity via think-aloud interviews with peer mentored athletes, and (c)…
Descriptors: Athletes, Mentors, Questionnaires, Test Construction
Buono, Stephanie; Jang, Eunice Eunhee – Educational Assessment, 2021
Increasing linguistic diversity in classrooms has led researchers to examine the validity and fairness of standardized achievement tests, specifically concerning whether test score interpretations are free of bias and score use is fair for all students. This study examined whether mathematics achievement test items that contain complex language…
Descriptors: English Language Learners, Standardized Tests, Achievement Tests, Culture Fair Tests
Kam, Chester Chun Seng – Educational and Psychological Measurement, 2016
To measure the response style of acquiescence, researchers recommend the use of at least 15 items with heterogeneous content. Such an approach is consistent with its theoretical definition and is a substantial improvement over traditional methods. Nevertheless, measurement of acquiescence can be enhanced by two additional considerations: first, to…
Descriptors: Test Items, Response Style (Tests), Test Content, Measurement
Kam, Chester Chun Seng; Zhou, Mingming – Educational and Psychological Measurement, 2015
Previous research has found the effects of acquiescence to be generally consistent across item "aggregates" within a single survey (i.e., essential tau-equivalence), but it is unknown whether this phenomenon is consistent at the" individual item" level. This article evaluated the often assumed but inadequately tested…
Descriptors: Test Items, Surveys, Criteria, Correlation
Liu, Yan; Zumbo, Bruno D.; Gustafson, Paul; Huang, Yi; Kroc, Edward; Wu, Amery D. – Practical Assessment, Research & Evaluation, 2016
A variety of differential item functioning (DIF) methods have been proposed and used for ensuring that a test is fair to all test takers in a target population in the situations of, for example, a test being translated to other languages. However, once a method flags an item as DIF, it is difficult to conclude that the grouping variable (e.g.,…
Descriptors: Test Items, Test Bias, Probability, Scores
Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014
Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…
Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests
Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013
The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…
Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests