ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Correlation	21
Test Reliability	21
Weighted Scores	21
Test Validity	13
Factor Analysis	6
Scores	6
Scoring	6
Statistical Analysis	6
Item Analysis	5
Scoring Formulas	5
Second Language Learning	5
Achievement Tests	4
Comparative Analysis	4
High School Students	4
Measurement Techniques	4
Multiple Choice Tests	4
Test Construction	4
Test Items	4
Aptitude Tests	3
English (Second Language)	3
Foreign Countries	3
Guessing (Tests)	3
Language Tests	3
Mathematical Models	3
Prediction	3
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
College Board	1
ETS Research Report Series	1
International Association for…	1
Journal of Educational and…	1
Language Learning in Higher…	1
Measurement in Physical…	1
PROFILE: Issues in Teachers'…	1

Publication Type

Reports - Research	8
Journal Articles	7
Collected Works - Proceedings	1
Non-Print Media	1
Reference Materials - General	1
Reports - Descriptive	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Postsecondary Education	4
Adult Education	1
Elementary Secondary Education	1
High Schools	1
Secondary Education	1

Audience

Location

Asia	1
Australia	1
Brazil	1
Colombia	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Ireland (Dublin)	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Michigan	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
California Achievement Tests	1
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

Peer reviewed

Direct link

Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017

In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…

Descriptors: Cutting Scores, Weighted Scores, Classification, Testing

Aligning English Language Testing with Curriculum

Peer reviewed
PDF on ERIC

Download full text

Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016

Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction

Standardising Assessment to Meet Student Needs in Foreign Language Modules in a University Context: Is Standardisation Possible?

Peer reviewed

Direct link

Nunan, Anna – Language Learning in Higher Education, 2014

The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…

Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards

The Reliability of a Promotional Job Knowledge Examination Scored by Number of Items Right and by Four Confidence Weighting Procedures and Its Corresponding Concurrent Validity Estimates Relative to Performance Criterion Ratings.

Peer reviewed

Friedland, David L.; Michael, William B. – Educational and Psychological Measurement, 1987

A sample of 153 male police officers were subjects in a test validation study with two objectives: (1) to compare reliability estimates of a 16-item objective achievement examination scored by the conventional items right formula and by four different procedures; and (2) to obtain comparative concurrent validity coefficients of scores arising from…

Descriptors: Achievement Tests, Concurrent Validity, Correlation, Police

A Comparison of Empirical Differential Option Weighting Scoring Procedures as a Function of Inter-Item Correlation

Peer reviewed

Bejar, Issac I.; Weiss, David J. – Educational and Psychological Measurement, 1977

The reliabilities yielded by several differential option weighting scoring procedures were compared among themselves as well as against conventional testing. It was found that increases in reliability due to differential option weighting were a function of inter-item correlations. Suggestions for the implementation of differential option weighting…

Descriptors: Correlation, Forced Choice Technique, Item Analysis, Scoring Formulas

The Effect of Differential Option Weighting on Multiple-Choice Objective Tests

Peer reviewed

Hendrickson, Gerry F. – Journal of Educational Measurement, 1971

Descriptors: Correlation, Guessing (Tests), Multiple Choice Tests, Sex Differences

Effects of Empirical Option Weighting on Reliability and Validity of the GRE.

Download full text

Reilly, Richard R.; Jackson, Rex – 1972

Item options of shortened forms of the Graduate Record Examination Verbal and Quantitative tests were empirically weighted by two variants of a method originally attributed to Guttman. The first method assigned to each option of an item the mean standard score on the remaining items of all subjects choosing that option. The second procedure…

Descriptors: Correlation, Factor Analysis, Graduate Study, Scoring

Reliability of a Unilateral Horizontal Leg Power Test to Assess Stretch Load Tolerance

Peer reviewed

Direct link

Simpson, Rhianna Parker; Cronin, John – Measurement in Physical Education and Exercise Science, 2006

Drop jumping has previously been used to measure fast stretch shorten cycle (SSC) ability and stretch load tolerance. To the knowledge of these authors a test does not exist to achieve this in the horizontal direction. The purpose of this study therefore was to estimate the reliability of a new unilateral horizontal leg power test to assess these…

Descriptors: Correlation, Test Reliability, Exercise Physiology, Physical Activity Level

A Procedure for Estimating the Unique Contribution of Each Component of a Composite Test: Uniqueness Analysis of Test 500. Technical Memorandum 76-8.

Download full text

Gavin, Anne T.; Martin, Charles G. – 1976

A procedure for estimating the degree to which a subtest uniquely contributes to total test performance is presented and discussed. Uniqueness analysis may be appropriately applied to any composite measurement instrument such as a multipart test or a multitest battery to assess the unique contribution of each component to the total test. The…

Descriptors: Aptitude Tests, Correlation, Occupational Tests, Scores

Unweighted and Weighted Measures of Children's Giving.

Peer reviewed

Vandenplas-Holper, Christiane; And Others – Educational and Psychological Measurement, 1987

An unweighted and two weighted measures of children's giving were used in two experiments designed to enhance children's prosocial development. This paper describes the rationale for the construction of the measures, presents data on test-retest reliability and validity as well as treatment outcomes for the two experiments. (Author/LMO)

Descriptors: Analysis of Variance, Correlation, Elementary Education, Prosocial Behavior

Effects of Empirical Option Weighting on Reliability and Validity of an Academic Aptitude Test

Peer reviewed

Reilly, Richard R.; Jackson, Rex – Journal of Educational Measurement, 1973

The present study suggests that although the reliability of an academic aptitude test given under formula-score condition can be increased substantially through empirical option weighting, much of the increase is due to the capitalization of the keying procedure on omitting tendencies which are reliable but not valid. (Author)

Descriptors: Aptitude Tests, Correlation, Factor Analysis, Item Sampling

Comparison of the Factor Structure of Guttman-Weighted vs. Rights-Only-Weighted Tests.

Download full text

Hendrickson, Gerry F.; Green, Bert F., Jr. – 1972

It has been shown that Guttman weighting of test options results in marked increases in the internal consistency of a test. However, the effect of this type of weighting on the structure of the test is not known. Hence, the purpose of this study is to compare the factor structure of Guttman-weighted and rights-only-weighted tests and to relate the…

Descriptors: Analysis of Variance, Correlation, Factor Analysis, Item Analysis

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)

The Effects of Choice Weights and Item Weights on the Reliability and Predictive Validity of Aptitude-Type Tests. Final Report.

Download full text

Bayuk, Robert J. – 1973

An investigation was conducted to determine the effects of response-category weighting and item weighting on reliability and predictive validity. Response-category weighting refers to scoring in which, for each category (including omit and "not read"), a weight is assigned that is proportional to the mean criterion score of examinees selecting…

Descriptors: Aptitude Tests, Correlation, Predictive Validity, Research Reports

Item Option Weighting of Achievement Tests: Comparative Study of Methods.

Download full text

Downey, Ronald G.

Previous research has studied the effects of different methods of item option weighting on the reliability and concurrent and predictive validity of achievement tests. Increases in reliability are generally found, but with mixed results for validity. Several methods of producing option weights, (i.e., Guttman internal and external weights and…

Descriptors: Achievement Tests, Comparative Analysis, Correlation, Grade Point Average

Previous Page | Next Page »

Pages: 1 | 2

Hendrickson, Gerry F.	3
Jackson, Rex	2
Reilly, Richard R.	2
Attali, Yigal	1
Bayuk, Robert J.	1
Bejar, Issac I.	1
Brown, James Dean	1
Cronin, John	1
Downey, Ronald G.	1
Friedland, David L.	1
Gavin, Anne T.	1
Gaviria, Sandra	1
Grabovsky, Irina	1
Green, Bert F., Jr.	1
Hendrickson, Amy	1
Martin, Charles G.	1
Melican, Gerald	1
Michael, William B.	1
Nunan, Anna	1
Palacio, Marcela	1
Patterson, Brian	1
Rippey, Robert M.	1
Roudabush, Glenn E.	1
Simpson, Rhianna Parker	1
More ▼