Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 18 |
Descriptor
Item Analysis | 236 |
Test Interpretation | 236 |
Test Construction | 91 |
Test Validity | 76 |
Test Reliability | 68 |
Test Items | 59 |
Achievement Tests | 47 |
Criterion Referenced Tests | 41 |
Test Results | 38 |
Statistical Analysis | 33 |
Testing | 32 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 8 |
Higher Education | 6 |
Elementary Education | 3 |
Postsecondary Education | 3 |
Adult Education | 1 |
Grade 6 | 1 |
High Schools | 1 |
Secondary Education | 1 |
Location
Pennsylvania | 6 |
Australia | 3 |
Canada | 3 |
Israel | 3 |
Michigan | 3 |
California | 2 |
South Carolina | 2 |
Alaska | 1 |
Brazil | 1 |
Finland | 1 |
Illinois | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
National Defense Education Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Chiavaroli, Neville – Practical Assessment, Research & Evaluation, 2017
Despite the majority of MCQ writing guides discouraging the use of negatively-worded multiple choice questions (NWQs), they continue to be regularly used both in locally produced examinations and commercially available questions. There are several reasons why the use of NWQs may prove resistant to sound pedagogical advice. Nevertheless, systematic…
Descriptors: Multiple Choice Tests, Test Construction, Test Items, Validity
Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017
The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…
Descriptors: Test Items, Test Bias, Item Response Theory, Surveys
Reynolds, Matthew R.; Niileksela, Christopher R. – Journal of Psychoeducational Assessment, 2015
"The Woodcock-Johnson IV Tests of Cognitive Abilities" (WJ IV COG) is an individually administered measure of psychometric intellectual abilities designed for ages 2 to 90+. The measure was published by Houghton Mifflin Harcourt-Riverside in 2014. Frederick Shrank, Kevin McGrew, and Nancy Mather are the authors. Richard Woodcock, the…
Descriptors: Cognitive Tests, Testing, Scoring, Test Interpretation
Chang, Wen-Chia Claire – ProQuest LLC, 2017
Preparing and supporting teachers to enact teaching practice that responds to diversity, challenges educational inequities, and promotes social justice is a pressing yet daunting and complex task. More research is needed to understand how and to what extent teacher education programs prepare and support teacher candidates to enhance the…
Descriptors: Test Construction, Educational Practices, Equal Education, Item Response Theory
Irby, Sarah M.; Floyd, Randy G. – Canadian Journal of School Psychology, 2013
The Wechsler Abbreviated Scale of Intelligence, Second Edition (WASI-II; Wechsler, 2011) is a brief intelligence test designed for individuals aged 6 through 90 years. It is a revision of the Wechsler Abbreviated Scale of Intelligence (WASI; Wechsler, 1999). During revision, there were three goals: enhancing the link between the Wechsler…
Descriptors: Test Reviews, Intelligence Tests, Psychometrics, Item Analysis
Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014
There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…
Descriptors: Student Surveys, National Surveys, Art Education, Design
Simek, Amber N.; Wahlberg, Andrea C. – Journal of Psychoeducational Assessment, 2011
This article reviews Autism Spectrum Rating Scales (ASRS) which are designed to measure behaviors in children between the ages of 2 and 18 that are associated with disorders on the autism spectrum as rated by parents/caregivers and/or teachers. The rating scales include items related to behaviors associated with Autism, Asperger's Disorder, and…
Descriptors: Autism, Mental Disorders, Test Reviews, Behavior Rating Scales
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010
Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…
Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification
Carretero-Dios, Hugo; Macarena, De los Santos-Roig; Buela-Casal, Gualberto – Learning and Individual Differences, 2008
This study is an item analysis of the Matching Familiar Figures Test-20. We examined error scores in the Matching Familiar Figures Test-20 to determine the influence of the difficulty of the test on the assessment of reflection-impulsivity. The sample included 700 participants aged between 6 and 12 years. The results obtained from the corrected…
Descriptors: Conceptual Tempo, Individual Differences, Item Analysis, Children
Westbrook, Bert W. – Measurement and Evaluation in Guidance, 1974
Descriptive behavioral statements were written for each of the 609 items found on six career development tests in order to compare the tests in terms of their coverage of the major career development components and in terms of the specific learner behaviors included in the test outline. (Author)
Descriptors: Career Development, Career Education, Item Analysis, Test Interpretation