ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	63
Since 2006 (last 20 years)	138

Descriptor

Scoring Formulas	582
Test Reliability	146
Multiple Choice Tests	120
Test Validity	105
Guessing (Tests)	100
Scoring	91
Higher Education	89
Evaluation Methods	77
Test Interpretation	76
Test Construction	74
Statistical Analysis	68
Test Items	68
Scores	59
Testing Problems	59
Comparative Analysis	53
Item Analysis	53
Evaluation Criteria	51
Measurement Techniques	51
Response Style (Tests)	49
Foreign Countries	45
Testing	45
Weighted Scores	45
Grading	44
Cutting Scores	42
Achievement Tests	41
More ▼

Education Level

Higher Education	59
Postsecondary Education	43
Elementary Secondary Education	17
Secondary Education	17
High Schools	10
Elementary Education	8
Adult Education	6
Middle Schools	6
Junior High Schools	5
Early Childhood Education	3
Grade 7	3
Grade 11	2
Primary Education	2
Grade 1	1
Grade 12	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 8	1
More ▼

Audience

Researchers	12
Practitioners	10
Community	5
Parents	5
Teachers	3
Policymakers	2

Location

Florida	7
United Kingdom	6
United Kingdom (England)	6
Australia	5
Canada	5
United States	5
Georgia	3
New York	3
North Carolina	3
Turkey	3
California	2
China	2
France	2
Germany	2
India	2
Japan	2
Malaysia	2
Massachusetts	2
Minnesota	2
New York (New York)	2
Ohio	2
Thailand	2
United Kingdom (Great Britain)	2
Virginia	2
Bosnia and Herzegovina…	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
No Child Left Behind Act 2001	3
Education for All Handicapped…	1
Individuals with Disabilities…	1
Serrano v Priest	1

What Works Clearinghouse Rating

Showing 316 to 330 of 582 results Save | Export

Elimination Scoring: An Empirical Evaluation

Peer reviewed

Collet, Leverne S. – Journal of Educational Measurement, 1971

The purpose of this paper was to provide an empirical test of the hypothesis that elimination scores are more reliable and valid than classical corrected-for-guessing scores or weighted-choice scores. The evidence presented supports the hypothesized superiority of elimination scoring. (Author)

Descriptors: Evaluation, Guessing (Tests), Multiple Choice Tests, Scoring Formulas

Binomial Test Models for Domain-Referenced Testing.

van den Brink, Wulfert – Evaluation in Education: International Progress, 1982

Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques

Inter-Marker Unreliability in SCE "O" Grade English Composition. Is Improvement Possible?

Peer reviewed

Spencer, Ernest – Scottish Educational Review, 1981

Using data from the SCRE Criterion Test composition papers, the author tests the hypothesis that the bulk of inter-marker unreliability is caused by inter-marker inconsistency--which is not correctable statistically. He suggests that a shift to "consensus" standards will realize greater improvements than statistical standardizing alone.…

Descriptors: Achievement Tests, English Instruction, Essay Tests, Reliability

Matrexam: A New Form of Computer Graded Examination.

Atkinson, George F.; Doadt, Edward – Assessment in Higher Education, 1980

Some perceived difficulties with conventional multiple choice tests are mentioned, and a modified form of examination is proposed. It uses a computer program to award partial marks for partially correct answers, full marks for correct answers, and check for widespread misunderstanding of an item or subject. (MSE)

Descriptors: Achievement Tests, Computer Assisted Testing, Higher Education, Multiple Choice Tests

Choice of the Metric for Effect Size in Meta-analysis.

Peer reviewed

McGaw, Barry; Glass, Gene V. – American Educational Research Journal, 1980

There are difficulties in expressing effect sizes on a common metric when some studies use transformed scales to express group differences, or use factorial designs or covariance adjustments to obtain a reduced error term. A common metric on which effect sizes may be standardized is described. (Author/RL)

Descriptors: Control Groups, Error of Measurement, Mathematical Models, Research Problems

The Relation of the Scale Coarseness to the Dependability of Marks.

Peer reviewed

Kleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979

Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)

Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading

Night of the Living Portfolios 2.

Berger, Peter N. – Teaching and Learning Literature with Children and Young Adults, 1997

Discusses problems with scoring reliability of the Vermont Education Department's writing portfolio test, particularly the difficulties teachers face in agreeing upon scoring criteria. (PA)

Descriptors: Elementary Secondary Education, Interrater Reliability, Portfolio Assessment, Portfolios (Background Materials)

Partial Information and the "Correction" for Guessing.

Frary, Robert B.; And Others – 1985

Students in an introductory college course (n=275) responded to equivalent 20-item halves of a test under number-right and formula-scoring instructions. Formula scores of those who omitted items overaged about one point lower than their comparable (formula adjusted) scores on the test half administered under number-right instructions. In contrast,…

Descriptors: Guessing (Tests), Higher Education, Multiple Choice Tests, Questionnaires

The Original and Revised Nedelsky Procedure: Comparisons with Two Non-subjective Approaches to Determining Cutoff Scores.

Download full text

Kazelskis, Richard; And Others – 1987

Numerous techniques are available for determining cutoff scores for distinguishing between proficient and non-proficient examinees. One of the more commonly cited techniques for standard setting is the Nedelsky Method. In response to criticism of this method, Gross (1985) presented a revised Nedelsky technique. However, no research beyond that…

Descriptors: Competence, Cutting Scores, Measurement Techniques, Scoring Formulas

Scoring Difficulties on the Wechsler Intelligence Scales

Peer reviewed

Brannigan, Gary G. – Psychology in the Schools, 1975

Several studies concerning scoring difficulties on the Wechsler intelligence scales were reviewed. Since scoring of responses on the comprehension, similarities and vocabulary subtests of the Wechsler scales demands judgements by the examiner, the possibility of poor interscorer reliability increases. More thorough scoring standards and revision…

Descriptors: Intelligence Differences, Intelligence Tests, Measurement Techniques, Psychological Testing

A Configuration-Scoring Paradigm for Identical Raw Scores

Peer reviewed

Baskin, David – Journal of Educational Measurement, 1975

Traditional test scoring does not allow the examination of differences among subjects obtaining identical raw scores on the same test. A configuration scoring paradigm for identical raw scores, which provides for such comparisons, is developed and illustrated. (Author)

Descriptors: Elementary Secondary Education, Individual Differences, Mathematical Models, Multiple Choice Tests

Proficiency Standards and Cut-Scores for Language Proficiency Tests.

Moy, Raymond H. – 1981

The problem of standard setting on language proficiency tests is often approached by the use of norms derived from the group being tested, a process commonly known as "grading on the curve." One particular problem with this ad hoc method of standard setting is that it will usually result in a fluctuating standard dependent on the particular group…

Descriptors: Cutting Scores, Higher Education, Language Proficiency, Norm Referenced Tests

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Effects of Different Methods of Weighting Subscores on the Composite-Score Ranking of Examinees.

Modu, Christopher C. – 1981

The effects of applying different methods of determining different sets of subscore weights on the composite score ranking of examinees were investigated. Four sets of subscore weights were applied to each of three examination results. The scores were from Advanced Placement (AP) Examinations in History of Art, Spanish Language, and Chemistry. One…

Descriptors: Advanced Placement Programs, Correlation, Equated Scores, Higher Education

An Approximately Reproducing Scoring Scheme that Aligns Random Response and Omission. Memorandum Report for Period July 1970-July 1971.

Download full text

Boldt, Robert F. – 1974

One formulation of confidence scoring requires the examinee to indicate as a number his personal probability of the correctness of each alternative in a multiple-choice test. For this formulation a linear transformation of the logarithm of the correct response is maximized if the examinee accurately reports his personal probability. To equate…

Descriptors: Confidence Testing, Guessing (Tests), Multiple Choice Tests, Probability

« Previous Page | Next Page »

Pages: 1 | ... | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | ... | 39

Educational and Psychological…	40
Journal of Educational…	27
Applied Psychological…	12
Journal of Experimental…	12
ETS Research Report Series	7
Journal of Clinical Psychology	7
Perceptual and Motor Skills	6
Florida Department of…	5
Psychology in the Schools	5
Evaluation in Education:…	4
Journal of School Psychology	4
Psychometrika	4
Applied Measurement in…	3
Assessment in Education:…	3
College Entrance Examination…	3
English Language Teaching	3
Evaluation and the Health…	3
International Education…	3
Journal of Business Education	3
Journal of Consulting and…	3
Journal of Learning…	3
Journal of Personality…	3
Northwest Evaluation…	3
Partnership for Assessment of…	3
ProQuest LLC	3
More ▼

Weiss, David J.	11
Frary, Robert B.	10
Wilcox, Rand R.	7
Lord, Frederic M.	6
Angoff, William H.	5
Echternacht, Gary	5
Plake, Barbara S.	5
Albanese, Mark A.	4
Cross, Lawrence H.	4
Hambleton, Ronald K.	4
Livingston, Samuel A.	4
Schrader, William B.	4
Boldt, Robert F.	3
Huynh, Huynh	3
Jacobs, Stanley S.	3
Reilly, Richard R.	3
Rippey, Robert M.	3
Traub, Ross E.	3
Veldman, Donald J.	3
Abu-Sayf, F. K.	2
Aiken, Lewis R.	2
Attali, Yigal	2
Bejar, Issac I.	2
Bliss, Leonard B.	2
More ▼

Reports - Research	274
Journal Articles	227
Speeches/Meeting Papers	71
Reports - Evaluative	58
Reports - Descriptive	48
Tests/Questionnaires	29
Guides - Non-Classroom	22
Information Analyses	17
Opinion Papers	13
Numerical/Quantitative Data	12
Guides - Classroom - Teacher	8
Guides - General	5
Dissertations/Theses -…	3
Books	2
Collected Works - Proceedings	2
Collected Works - Serials	2
Guides - Classroom - Learner	2
Collected Works - General	1
Reports - General	1
More ▼

SAT (College Admission Test)	10
Wechsler Intelligence Scale…	10
Graduate Record Examinations	7
Bender Gestalt Test	4
General Aptitude Test Battery	3
Graduate Management Admission…	3
Group Embedded Figures Test	3
Test of English as a Foreign…	3
Armed Services Vocational…	2
Bem Sex Role Inventory	2
California Achievement Tests	2
College Board Achievement…	2
Comprehensive Tests of Basic…	2
Iowa Tests of Basic Skills	2
Matching Familiar Figures Test	2
Peabody Picture Vocabulary…	2
Rod and Frame Test	2
Strong Vocational Interest…	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
Adaptive Behavior Scale	1
Advanced Placement…	1
Attitudes Toward Women Scale	1
Bender Visual Motor Gestalt…	1
British Ability Scales	1
More ▼