ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Scoring Formulas	10
Test Reliability	10
Evaluation Criteria	3
Testing Problems	3
Cutting Scores	2
Error of Measurement	2
Evaluation Methods	2
Grading	2
Guessing (Tests)	2
Measurement Techniques	2
Scores	2
Test Validity	2
Accuracy	1
Achievement Rating	1
Administration	1
Behavioral Objectives	1
Benchmarking	1
Capacity Building	1
Classification	1
College Entrance Examinations	1
Computer Assisted Testing	1
Creative Activities	1
Creative Thinking	1
Creativity	1
Criterion Referenced Tests	1
More ▼

Source

Educational Leadership	2
Educational and Psychological…	2
Assessment & Evaluation in…	1
Creativity Research Journal	1
International Review of…	1
National Center for Research…	1

Author

Acar, Selcuk	1
Albanese, Mark A.	1
Burton, Richard F.	1
Feldman, Jo	1
Griffin, Noelle	1
Guskey, Thomas R.	1
Jung, Lee Ann	1
Kambal, M. Osman	1
Kroc, Edward	1
Niemi, David	1
Olvera Astivia, Oscar L.	1
Raju, Nambury S.	1
Runco, Mark A.	1
Vallone, Julia	1
Wang, Haiwen	1
Wang, Jia	1
Yen, Wendy M.	1
Zughoul, Muhammad R.	1
More ▼

Publication Type

Reports - Evaluative	10
Journal Articles	7
Speeches/Meeting Papers	2
Guides - Classroom - Teacher	1
Reports - Research	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education

Audience

Location

Mississippi

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

The End of Points

Direct link

Feldman, Jo – Educational Leadership, 2018

Have teachers become too dependent on points? This article explores educators' dependency on their points systems, and the ways that points can distract teachers from really analyzing students' capabilities and achievements. Feldman argues that using a more subjective grading system can help illuminate crucial information about students and what…

Descriptors: Grading, Evaluation Methods, Evaluation Criteria, Achievement Rating

Grading: Why You Should Trust Your Judgment

Direct link

Guskey, Thomas R.; Jung, Lee Ann – Educational Leadership, 2016

Many educators consider grades calculated from statistical algorithms more accurate, objective, and reliable than grades they calculate themselves. But in this research, the authors first asked teachers to use their professional judgment to choose a summary grade for hypothetical students. When the researchers compared the teachers' grade with the…

Descriptors: Grading, Computer Assisted Testing, Interrater Reliability, Grades (Scholastic)

Divergent Thinking as an Indicator of Creative Potential

Peer reviewed

Direct link

Runco, Mark A.; Acar, Selcuk – Creativity Research Journal, 2012

Divergent thinking (DT) tests are very often used in creativity studies. Certainly DT does not guarantee actual creative achievement, but tests of DT are reliable and reasonably valid predictors of certain performance criteria. The validity of DT is described as reasonable because validity is not an all-or-nothing attribute, but is, instead, a…

Descriptors: Creativity, Creative Activities, Creative Thinking, Test Validity

The Reliability of a Criterion-Referenced Composite with the Parts of the Composite Having Different Cutting Scores.

Peer reviewed

Raju, Nambury S. – Educational and Psychological Measurement, 1982

Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Recommendations for Building a Valid Benchmark Assessment System: Second Report to the Jackson Public Schools. CRESST Report 724

Download full text

Niemi, David; Wang, Jia; Wang, Haiwen; Vallone, Julia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

There are usually many testing activities going on in a school, with different tests serving different purposes, thus organization and planning are key in creating an efficient system in assessing the most important educational objectives. In the ideal case, an assessment system will be able to inform on student learning, instruction and…

Descriptors: School Administration, Educational Objectives, Administration, Public Schools

Objective Evaluation of EFL Composition.

Peer reviewed

Zughoul, Muhammad R.; Kambal, M. Osman – International Review of Applied Linguistics in Language Teaching, 1983

Based on the responses of 50 ESL instructors to a composition-scoring exercise, a detailed method of scoring compositions was developed that divides the writing into basic components (structure, content, vocabulary, organization, and mechanics) and provides a scoring mechanism for each component for each of three competency levels. (MSE)

Descriptors: English (Second Language), Evaluation Criteria, Evaluation Methods, Measurement Techniques

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Some Comments on the Correction for Guessing. A Further Analysis of Angoff and Schrader.

Download full text

Albanese, Mark A. – 1985

This study reexamines results reported by Angoff and Schrader regarding formula directions and rights directions for standardized tests. In that study, it was concluded that the two scoring directions were essentially equivalent. In this study, methodological concerns are discussed and additional data analyses undertaken. Among various…

Descriptors: College Entrance Examinations, Data Interpretation, Fatigue (Biology), Guessing (Tests)