Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Emons, Wilco H. M. – Applied Psychological Measurement, 2009
For valid decision making, it is essential to both the person being measured and the person or organization that is having the person measured that the observed scores adequately represent the underlying trait. This study deals with person-fit analysis of polytomous item scores to detect unusual patterns of sum scores on subsets of items. This…
Descriptors: Personality Theories, Personality Measures, Scores, Test Items
Wuang, Yee-Pay; Lin, Yueh-Hsien; Su, Chwen-Yng – Research in Developmental Disabilities: A Multidisciplinary Journal, 2009
The Bruininks-Oseretsky Test of Motor Proficiency-Second Edition (BOT-2) is widely used to assess motor skills for both clinical and research purposes; however, its validity has not been adequately assessed in intellectual disabilities (ID). This study used partial credit Rasch model to examine the measurement properties of the BOT-2 among 446…
Descriptors: Mental Retardation, Item Response Theory, Ability, Test Items
Weitzman, R. A. – Educational and Psychological Measurement, 2009
Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Test Bias
Lee, Young-Sun; Wollack, James A.; Douglas, Jeffrey – Educational and Psychological Measurement, 2009
The purpose of this study was to assess the model fit of a 2PL through comparison with the nonparametric item characteristic curve (ICC) estimation procedures. Results indicate that three nonparametric procedures implemented produced ICCs that are similar to that of the 2PL for items simulated to fit the 2PL. However for misfitting items,…
Descriptors: Nonparametric Statistics, Item Response Theory, Test Items, Simulation
Qian, Hong – ProQuest LLC, 2013
This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…
Descriptors: Teacher Education Programs, Pedagogical Content Knowledge, Preservice Teacher Education, Preservice Teachers
Maynard, Jennifer Leigh – ProQuest LLC, 2012
Emphasis on regular mathematics skill assessment, intervention, and progress monitoring under the RTI model has created a need for the development of assessment instruments that are psychometrically sound, reliable, universal, and brief. Important factors to consider when developing or selecting assessments for the school environment include what…
Descriptors: Response to Intervention, Mathematics Skills, Student Evaluation, Progress Monitoring
Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012
The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…
Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests
Wang, Hsuan-Po; Kuo, Bor-Chen; Tsai, Ya-Hsun; Liao, Chen-Huei – Turkish Online Journal of Educational Technology - TOJET, 2012
In the era of globalization, the trend towards learning Chinese as a foreign language (CFL) has become increasingly popular worldwide. The increasing demand in learning CFL has raised the profile of the Chinese proficiency test (CPT). This study will analyze in depth the inadequacy of current CPT's utilizing the common European framework of…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Global Approach
Lakin, Joni M.; Gambrell, James L. – Intelligence, 2012
Measures of broad fluid abilities including verbal, quantitative, and figural reasoning are commonly used in the K-12 school context for a variety of purposes. However, differentiation of these domains is difficult for young children (grades K-2) who lack basic linguistic and mathematical literacy. This study examined the latent factor structure…
Descriptors: Evidence, Validity, Item Response Theory, Numeracy
Wilson, Jackson – ProQuest LLC, 2010
Dysfunctional voluntary employee turnover is an issue that leads to major direct and indirect costs (e.g., Sagie, Birati, & Tziner, 2002). Although job satisfaction has classically been the predominant construct used to explain turnover, recently a new construct, job embeddedness, has been relatively successful at helping explain additional…
Descriptors: Adventure Education, Job Satisfaction, Labor Turnover, Test Construction
Dumas, Helene M. – International Journal of Rehabilitation Research, 2010
The PEDI-CAT is a new computer adaptive test (CAT) version of the Pediatric Evaluation of Disability Inventory (PEDI). Additional PEDI-CAT items specific to postacute pediatric hospital care were recently developed using expert reviews and cognitive interviewing techniques. Expert reviews established face and construct validity, providing positive…
Descriptors: Hospitals, Adaptive Testing, Content Validity, Construct Validity
DiStefano, Christine; Morgan, Grant B. – School Psychology Quarterly, 2010
This study examined the Behavioral and Emotional Screening System Teacher Rating System for Children and Adolescents (BESS TRS-CA; Kamphaus & Reynolds, 2007) screener using Rasch Rating Scale model (RSM) methodology to provide additional information about psychometric properties of items. Data from the Behavioral Assessment System for Children…
Descriptors: Rating Scales, Item Response Theory, Children, Adolescents
Clary, Renee M.; Wandersee, James H. – Science & Education, 2010
Archive-based, historical research of materials produced during the Golden Age of Geology (1788-1840) uncovered scientific caricatures (SCs) which may serve as a unique form of knowledge representation for students today. SCs played important roles in the past, stimulating critical inquiry among early geologists and fueling debates that addressed…
Descriptors: Test Items, Student Evaluation, Student Reaction, Alternative Assessment
Randall, Jennifer; Engelhard, George, Jr. – Applied Measurement in Education, 2010
The psychometric properties and multigroup measurement invariance of scores across subgroups, items, and persons on the "Reading for Meaning" items from the Georgia Criterion Referenced Competency Test (CRCT) were assessed in a sample of 778 seventh-grade students. Specifically, we sought to determine the extent to which score-based…
Descriptors: Testing Accommodations, Test Items, Learning Disabilities, Factor Analysis
Item Equivalence in English and Chinese Translation of a Cognitive Development Test for Preschoolers
He, Wei; Wolfe, Edward W. – International Journal of Testing, 2010
This article reports the results of a study of potential sources of item nonequivalence between English and Chinese language versions of a cognitive development test for preschool-aged children. Items were flagged for potential nonequivalence through statistical and judgment-based procedures, and the relationship between flag status and item…
Descriptors: Preschool Children, Mandarin Chinese, Cognitive Development, Item Analysis

Peer reviewed
Direct link
