NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)6
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Martin, Jeremy P. – Change: The Magazine of Higher Learning, 2015
Rankings are a powerful force in higher education, swaying the enrollment decisions of prospective students and affecting the opinions of parents, board members, and policymakers. In the words of one provost, "The rankings matter to our university because they matter to people who matter to us." Rankings are also a business--one that is…
Descriptors: Higher Education, Achievement Rating, Institutional Characteristics, Reputation
Northwest Evaluation Association, 2016
Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…
Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores
Northwest Evaluation Association, 2015
Concordance tables have been used for decades to relate scores on different tests measuring similar but distinct constructs. These tables, typically derived from statistical linking procedures, provide a direct link between scores on different tests and serve various purposes. Aside from describing how a score on one test relates to performance on…
Descriptors: Outcome Measures, Tables (Data), Language Arts, English Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Runco, Mark A.; Acar, Selcuk – Creativity Research Journal, 2012
Divergent thinking (DT) tests are very often used in creativity studies. Certainly DT does not guarantee actual creative achievement, but tests of DT are reliable and reasonably valid predictors of certain performance criteria. The validity of DT is described as reasonable because validity is not an all-or-nothing attribute, but is, instead, a…
Descriptors: Creativity, Creative Activities, Creative Thinking, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schochet, Peter Z.; Chiang, Hanley S. – National Center for Education Evaluation and Regional Assistance, 2010
This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Student Evaluation, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Baldi, Stephane, Ed.; Kutner, Mark; Greenberg, Elizabeth; Jin, Ying; Baer, Justin; Moore, Elizabeth; Dunleavy, Eric; Berlin, Martha; Mohadjer, Leyla; Binzer, Greg; Krenzke, Thomas; Hogan, Jacqueline; Amsbary, Michelle; Forsyth, Barbara; Clark, Lyn; Annis, Terri; Bernstein, Jared; White, Sheida – National Center for Education Statistics, 2009
The 2003 National Assessment of Adult Literacy (NAAL) assessed the English literacy skills of a nationally representative sample of more than 19,000 U.S. adults (age 16 and older) residing in households and correctional institutions. NAAL is the first national assessment of adult literacy since the 1992 National Adult Literacy Survey (NALS). The…
Descriptors: Correctional Institutions, Scaling, Numeracy, Field Tests
Peer reviewed Peer reviewed
Albanese, Mark A. – Journal of Educational Measurement, 1988
Estimates of the effects of use of formula scoring on the individual examinee's score are presented. Results for easy, moderate, and hard tests are examined. Using test characteristics from several studies shows that some examinees would increase scores substantially if they were to answer items omitted under formula directions. (SLD)
Descriptors: Difficulty Level, Guessing (Tests), Scores, Scoring Formulas
Peer reviewed Peer reviewed
Drasgow, Fritz; And Others – Applied Psychological Measurement, 1989
Multilinear formula scoring (MFS) is reviewed, with emphasis on estimating option characteristic curves (OCSs). MFS was used to estimate OCSs for the arithmetic reasoning subtest of the Armed Services Vocational Aptitude Battery for 2,978 examinees. A second analysis obtained OCSs for simulated data. The use of MFS is discussed. (SLD)
Descriptors: Estimation (Mathematics), Mathematical Models, Multiple Choice Tests, Scores
Livingston, Samuel A. – 1981
The standard error of measurement (SEM) is a measure of the inconsistency in the scores of a particular group of test-takers. It is largest for test-takers with scores ranging in the 50 percent correct bracket; with nearly perfect scores, it is smaller. On tests used to make pass/fail decisions, the test-takers' scores tend to cluster in the range…
Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Formulas, Pass Fail Grading
Peer reviewed Peer reviewed
Direct linkDirect link
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Cole, Nancy S. – 1982
The advantages and disadvantages of grade equivalent (GE) scores are explored, including appropriate uses for GE type scores and how to bring current GE scales closer to the type of information educators appear to desire. Although GE scores are not an equal interval scale, not comparable across school subjects, and do not indicate the grade level…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Methods, Formative Evaluation
Nevo, David – 1989
The purpose of this study was to develop a testing method for the assessment of various types of writing at the elementary school level that would meet acceptable standards for educational measurement instruments as well as standards of utility and feasibility within a given educational system. The study was conducted within the framework of an…
Descriptors: Elementary School Students, Essay Tests, Expressive Language, Foreign Countries