NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)42
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 77 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jin, Kuan-Yu; Wang, Wen-Chung – Journal of Educational Measurement, 2018
The Rasch facets model was developed to account for facet data, such as student essays graded by raters, but it accounts for only one kind of rater effect (severity). In practice, raters may exhibit various tendencies such as using middle or extreme scores in their ratings, which is referred to as the rater centrality/extremity response style. To…
Descriptors: Scoring, Models, Interrater Reliability, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Stuart, Nichola J.; Connelly, Vincent; Dockrell, Julie E. – Reading and Writing: An Interdisciplinary Journal, 2020
Verb use and the production of verb argument structure in the written texts of children in elementary school is a key stepping stone towards academic writing success that has remained relatively unexplored and is a notable gap in our understanding of writing development. To evaluate the role of verbs in the written narrative texts of children, we…
Descriptors: Verbs, Academic Language, Written Language, Elementary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013
Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…
Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Beauducel, Andre; Leue, Anja – Practical Assessment, Research & Evaluation, 2013
In several studies unit-weighted sum scales based on the unweighted sum of items are derived from the pattern of salient loadings in confirmatory factor analysis. The problem of this procedure is that the unit-weighted sum scales imply a model other than the initially tested confirmatory factor model. In consequence, it remains generally unknown…
Descriptors: Factor Analysis, Structural Equation Models, Goodness of Fit, Personality Measures
Peer reviewed Peer reviewed
Direct linkDirect link
Lopez, Francesca; Olson, Amy; Bansal, Naveen – Journal of Psychoeducational Assessment, 2011
Individually administered tests are often normed on small samples, a process that may result in irregularities within and across various age or grade distributions. Test users often smooth distributions guided by Thurstone assumptions (normality and linearity) to result in norms that adhere to assumptions made about how the data should look. Test…
Descriptors: Age Groups, Sampling, Sample Size, Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Flipsen, Peter, Jr. – Volta Review, 2011
This study examines use of the Goldman-Fristoe Test of Articulation-Second Edition (GFTA-2) with children who use cochlear implants to evaluate whether or not it would be appropriate to use this test with this population. Participants included 15 children with cochlear implants who ranged in age of implantation and amount of implant experience.…
Descriptors: Children, Assistive Technology, Articulation (Speech), Speech Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012
The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…
Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Puhan, Gautam; von Davier, Alina A.; Gupta, Shaloo – Educational and Psychological Measurement, 2010
Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…
Descriptors: Statistical Distributions, Raw Scores, Equated Scores, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Costrell, Robert M. – Journal of School Choice, 2015
District costs for teachers' health insurance are, on average, higher then employer costs for private-sector professionals. How much of this is attributable to collective bargaining? This article examines the question using data from the National Compensation Survey (NCS) of the Bureau of Labor Statistics (BLS) and the state of Wisconsin. In…
Descriptors: Collective Bargaining, Health Insurance, Teacher Employment Benefits, National Surveys
Peer reviewed Peer reviewed
Direct linkDirect link
Garcia-Perez, Miguel A. – Journal of Educational and Behavioral Statistics, 2010
A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…
Descriptors: Computation, Statistical Analysis, True Scores, Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Facon, Bruno; Nuchadee, Marie-Laure – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
Standardized tests are widely used in intellectual disability research, either as dependent or control variables. Yet, it is not certain that their items give rise to the same performance in various groups under study. In the present work, 48 participants with Down syndrome were matched on their raw score on Raven's Colored Progressive Matrices…
Descriptors: Test Items, Standardized Tests, Down Syndrome, Item Analysis
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Van Herwegen, Jo; Farran, Emily; Annaz, Dagmara – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Raven's Coloured Progressive Matrices (RCPM) is a standardised test that is commonly used to obtain a non-verbal reasoning score for children. As the RCPM involves the matching of a target to a pattern it is also considered to be a visuo-spatial perception task. RCPM is therefore frequently used in studies in Williams Syndrome (WS), in order to…
Descriptors: Control Groups, Raw Scores, Spatial Ability, Cognitive Processes
Peer reviewed Peer reviewed
Direct linkDirect link
Hennessey, Stephen – International Journal of Disability, Development and Education, 2011
This article describes a method for identifying test items as disability neutral for children with vision and motor disabilities. Graduate students rated 130 items of the Preschool Language Scale and obtained inter-rater correlation coefficients of 0.58 for ratings of items as disability neutral for children with vision disability, and 0.77 for…
Descriptors: Graduate Students, Test Items, Physical Disabilities, Multiple Disabilities
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6