Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 11 |
Descriptor
Construct Validity | 16 |
Evaluation Methods | 16 |
Test Items | 16 |
Test Construction | 8 |
Test Validity | 6 |
Correlation | 5 |
Foreign Countries | 4 |
Item Response Theory | 4 |
Models | 4 |
Content Validity | 3 |
Factor Analysis | 3 |
More ▼ |
Source
Author
Anani Sarab, Mohammad Reza | 1 |
Avsec, Stanislav | 1 |
Bejar, Isaac I. | 1 |
Borowski, Andreas | 1 |
Burling, Kelly | 1 |
Clough, Peter J. | 1 |
Cook Whitt, Katahdin | 1 |
Crust, Lee | 1 |
Dolan, Robert P. | 1 |
Emmerich, Walter | 1 |
Fischer, Hans E. | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Research | 10 |
Reports - Evaluative | 5 |
Tests/Questionnaires | 2 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Secondary Education | 4 |
High Schools | 3 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 1 |
Hidden Figures Test | 1 |
What Works Clearinghouse Rating
Anani Sarab, Mohammad Reza; Rahmani, Simindokht – International Journal of Language Testing, 2023
Language testing and assessment have grown in popularity and gained significance in the last few decades, and there is a rising need for assessment literate stakeholders in the field of language education. As teachers play a major role in assessing students, there is a need to make sure they have the right level of assessment knowledge and skills…
Descriptors: Language Tests, Literacy, Second Language Learning, Factor Analysis
Spurgeon, Shawn L. – Measurement and Evaluation in Counseling and Development, 2017
Construct irrelevance (CI) and construct underrepresentation (CU) are 2 major threats to validity, yet they are rarely discussed within the counseling literature. This article provides information about the relevance of these threats to internal validity. An illustrative case example will be provided to assist counselors in understanding these…
Descriptors: Construct Validity, Evaluation Criteria, Evaluation Methods, Evaluation Problems
Holmes, Stephen D.; He, Qingping; Meadows, Michelle – Research in Mathematics Education, 2017
The relationship between the characteristics of 33 mathematical problem-solving questions answered by 16-year-old students in England and the quality of problem-solving elicited was investigated in two studies. The first study used comparative judgement (CJ) to estimate the quality of the problem-solving elicited by each question, involving 33…
Descriptors: Foreign Countries, Mathematics Skills, Problem Solving, Mathematical Logic
Avsec, Stanislav; Jamšek, Janez – International Journal of Technology and Design Education, 2016
Technological literacy is identified as a vital achievement of technology- and engineering-intensive education. It guides the design of technology and technical components of educational systems and defines competitive employment in technological society. Existing methods for measuring technological literacy are incomplete or complicated,…
Descriptors: Technological Literacy, Elementary School Students, Secondary School Students, Evaluation Methods
Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017
We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…
Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods
Perry, John L.; Clough, Peter J.; Crust, Lee; Nabb, Sam L.; Nicholls, Adam R. – Research Quarterly for Exercise and Sport, 2015
Purpose: A new measure of sportspersonship, which differentiates between compliance and principled approaches, was developed and initially validated in 3 studies. Method: Study 1 developed items, assessed content validity, and proposed a model. Study 2 tested the factorial validity of the model on an independent sample. Study 3 further tested the…
Descriptors: Program Development, Program Validation, Physical Education, Compliance (Legal)
Kirschner, Sophie; Borowski, Andreas; Fischer, Hans E.; Gess-Newsome, Julie; von Aufschnaiter, Claudia – International Journal of Science Education, 2016
Teachers' professional knowledge is assumed to be a key variable for effective teaching. As teacher education has the goal to enhance professional knowledge of current and future teachers, this knowledge should be described and assessed. Nevertheless, only a limited number of studies quantitatively measures physics teachers' professional…
Descriptors: Evaluation Methods, Tests, Test Format, Science Instruction
Dolan, Robert P.; Burling, Kelly; Harms, Michael; Strain-Seymour, Ellen; Way, Walter; Rose, David H. – Pearson, 2013
The increased capabilities offered by digital technologies offer new opportunities to evaluate students' deeper knowledge and skills and on constructs that are difficult to measure using traditional methods. Such assessments can also incorporate tools and interfaces that improve accessibility for diverse students, as well as inadvertently…
Descriptors: Educational Technology, Technology Uses in Education, Access to Education, Evaluation Methods
Liu, Xiufeng; Waight, Noemi; Gregorius, Roberto; Smith, Erica; Park, Mihwa – Journal of Computers in Mathematics and Science Teaching, 2012
This paper reports a feasibility study on developing computer model-based assessments of chemical reasoning at the high school level. Computer models are flash and NetLogo environments to make simultaneously available three domains in chemistry: macroscopic, submicroscopic, and symbolic. Students interact with computer models to answer assessment…
Descriptors: Validity, Chemistry, Units of Study, Construct Validity
Petry, Katja; Maes, Bea; Vlaskamp, Carla – Research in Developmental Disabilities: A Multidisciplinary Journal, 2009
Because of a shortage of valid instruments to measure the QOL of people with profound multiple disabilities (PMD), the QOL-PMD was developed. In the present study, possibilities for item reduction as well as the psychometric properties of the questionnaire were examined. One hundred and forty-seven informants of people with PMD participated in the…
Descriptors: Multiple Disabilities, Quality of Life, Construct Validity, Questionnaires
Penfield, Randall D.; Giacobbi, Peter R., Jr.; Myers, Nicholas D. – Research Quarterly for Exercise and Sport, 2007
One aspect of construct validity is the extent to which the measurement properties of a rating scale are invariant across the groups being compared. An increasingly used method for assessing between-group differences in the measurement properties of items of a scale is the framework of differential item functioning (DIF). In this paper we…
Descriptors: Physical Education, Test Items, Construct Validity, Test Validity
Hagtvet, Knut A.; Nasser, Fadia M. – Structural Equation Modeling, 2004
This article presents a methodology for examining the content and nature of item parcels as indicators of a conceptually defined latent construct. An essential component of this methodology is the 2-facet measurement model, which includes items and parcels as facets of construct indicators. The 2-facet model tests assumptions required for…
Descriptors: Evaluation Methods, Validity, Test Anxiety, Content Validity
Emmerich, Walter; And Others – 1991
The aim of this research was to identify, develop, and evaluate empirically new reasoning item types that might be used to broaden the analytical measure of the Graduate Record Examinations (GRE) General Test and to strengthen its construct validity. Six item types were selected for empirical evaluation, including the two currently used in the GRE…
Descriptors: Construct Validity, Correlation, Evaluation Methods, Sex Differences

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity

Bejar, Isaac I.; Yocom, Peter – Applied Psychological Measurement, 1991
An approach to test modeling is illustrated that encompasses both response consistency and response difficulty. This generative approach makes validation an ongoing process. An analysis of hidden figure items with 60 high school students supports the feasibility of the method. (SLD)
Descriptors: Construct Validity, Difficulty Level, Evaluation Methods, High School Students
Previous Page | Next Page »
Pages: 1 | 2