Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 14 |
Descriptor
Item Analysis | 36 |
Test Theory | 36 |
Test Validity | 36 |
Test Items | 21 |
Test Reliability | 18 |
Test Construction | 14 |
Latent Trait Theory | 8 |
Achievement Tests | 7 |
Psychometrics | 7 |
Foreign Countries | 6 |
Higher Education | 6 |
More ▼ |
Source
Author
Haladyna, Tom | 3 |
Beddow, Peter A. | 2 |
Roid, Gale | 2 |
Algina, James | 1 |
Anita Padmanabhanunni | 1 |
Arias, Benito | 1 |
Bichi, Ado Abdu | 1 |
Blair, Bernadette | 1 |
Broussard, Rolland L. | 1 |
Bullock, Lyndal M. | 1 |
Campbell, J. F. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 4 |
Adult Education | 3 |
Elementary Education | 3 |
Elementary Secondary Education | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
More ▼ |
Audience
Researchers | 2 |
Students | 1 |
Location
Jordan | 1 |
Singapore | 1 |
South Africa | 1 |
Spain | 1 |
Texas | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
California Achievement Tests | 1 |
Dyadic Adjustment Scale | 1 |
Embedded Figures Test | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Tyrone B. Pretorius; P. Paul Heppner; Anita Padmanabhanunni; Serena Ann Isaacs – SAGE Open, 2023
In previous studies, problem solving appraisal has been identified as playing a key role in promoting positive psychological well-being. The Problem Solving Inventory is the most widely used measure of problem solving appraisal and consists of 32 items. The length of the instrument, however, may limit its applicability to large-scale surveys…
Descriptors: Problem Solving, Measures (Individuals), Test Construction, Item Response Theory
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Different Analyses, Different Conclusions? Validity Evidence from the EGMA Spatial Reasoning Subtask
Perry, Lindsey – Global Education Review, 2018
As the global development community shifts its focus from improving access to education to improving learning and instruction, the need for instruments that accurately measure student achievement in mathematics and meet technical standards is increasing. This paper explores the importance of collecting high-quality validity evidence that aligns…
Descriptors: Mathematics Tests, Test Validity, Spatial Ability, Foreign Countries
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014
There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…
Descriptors: Student Surveys, National Surveys, Art Education, Design
Gomez, Laura E.; Arias, Benito; Verdugo, Miguel Angel; Navas, Patricia – Journal of Intellectual & Developmental Disability, 2012
Background: Most instruments that assess quality of life have been validated by means of the classical test theory (CTT). However, CTT limitations have resulted in the development of alternative models, such as the Rasch rating scale model (RSM). The main goal of this paper is testing and improving the psychometric properties of the INTEGRAL…
Descriptors: Evidence, Models, Mental Retardation, Quality of Life
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012
The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…
Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten
Cox, Judith Ellen – ProQuest LLC, 2010
Emotional intelligence in relationships can be developed and enhanced through the use of an assessment instrument within a mentoring or counseling relationship. The Relationship Skills Map (RSM) has been created for this purpose. This study concerns the validation of the Relationship Skills Map. Participants in this study included members of a…
Descriptors: Stress Management, Graduate Students, Emotional Intelligence, Time Management
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Kettler, Ryan J.; Elliott, Stephen N.; Beddow, Peter A. – Peabody Journal of Education, 2009
Federal regulations allow up to 2% of the student population of a state to achieve proficiency for adequate yearly progress by taking an alternate assessment based on modified academic achievement standards (AA-MAS). Such tests are likely to be easier, but as long as a test is considered a valid measure of grade level content, it is allowable as…
Descriptors: Test Items, Alternative Assessment, Academic Achievement, Test Validity
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Sainty, Geoffrey E. – Journal of Vocational Behavior, 1974
An empirical validation of the 114 Worker Trait Groups of the Dictionary of Occupational Titles was performed by comparing the factor structure of the worker trait components of the 114 WTG's with the factor structure of a random sample of 800 of the 4000 jobs used as the basis for DOT. (Author)
Descriptors: Employment, Item Analysis, Occupations, Test Theory