NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)7
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 21 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017
In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…
Descriptors: Cutting Scores, Weighted Scores, Classification, Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016
Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Peer reviewed Peer reviewed
Friedland, David L.; Michael, William B. – Educational and Psychological Measurement, 1987
A sample of 153 male police officers were subjects in a test validation study with two objectives: (1) to compare reliability estimates of a 16-item objective achievement examination scored by the conventional items right formula and by four different procedures; and (2) to obtain comparative concurrent validity coefficients of scores arising from…
Descriptors: Achievement Tests, Concurrent Validity, Correlation, Police
Peer reviewed Peer reviewed
Bejar, Issac I.; Weiss, David J. – Educational and Psychological Measurement, 1977
The reliabilities yielded by several differential option weighting scoring procedures were compared among themselves as well as against conventional testing. It was found that increases in reliability due to differential option weighting were a function of inter-item correlations. Suggestions for the implementation of differential option weighting…
Descriptors: Correlation, Forced Choice Technique, Item Analysis, Scoring Formulas
Peer reviewed Peer reviewed
Hendrickson, Gerry F. – Journal of Educational Measurement, 1971
Descriptors: Correlation, Guessing (Tests), Multiple Choice Tests, Sex Differences
Reilly, Richard R.; Jackson, Rex – 1972
Item options of shortened forms of the Graduate Record Examination Verbal and Quantitative tests were empirically weighted by two variants of a method originally attributed to Guttman. The first method assigned to each option of an item the mean standard score on the remaining items of all subjects choosing that option. The second procedure…
Descriptors: Correlation, Factor Analysis, Graduate Study, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Simpson, Rhianna Parker; Cronin, John – Measurement in Physical Education and Exercise Science, 2006
Drop jumping has previously been used to measure fast stretch shorten cycle (SSC) ability and stretch load tolerance. To the knowledge of these authors a test does not exist to achieve this in the horizontal direction. The purpose of this study therefore was to estimate the reliability of a new unilateral horizontal leg power test to assess these…
Descriptors: Correlation, Test Reliability, Exercise Physiology, Physical Activity Level
Gavin, Anne T.; Martin, Charles G. – 1976
A procedure for estimating the degree to which a subtest uniquely contributes to total test performance is presented and discussed. Uniqueness analysis may be appropriately applied to any composite measurement instrument such as a multipart test or a multitest battery to assess the unique contribution of each component to the total test. The…
Descriptors: Aptitude Tests, Correlation, Occupational Tests, Scores
Peer reviewed Peer reviewed
Vandenplas-Holper, Christiane; And Others – Educational and Psychological Measurement, 1987
An unweighted and two weighted measures of children's giving were used in two experiments designed to enhance children's prosocial development. This paper describes the rationale for the construction of the measures, presents data on test-retest reliability and validity as well as treatment outcomes for the two experiments. (Author/LMO)
Descriptors: Analysis of Variance, Correlation, Elementary Education, Prosocial Behavior
Peer reviewed Peer reviewed
Reilly, Richard R.; Jackson, Rex – Journal of Educational Measurement, 1973
The present study suggests that although the reliability of an academic aptitude test given under formula-score condition can be increased substantially through empirical option weighting, much of the increase is due to the capitalization of the keying procedure on omitting tendencies which are reliable but not valid. (Author)
Descriptors: Aptitude Tests, Correlation, Factor Analysis, Item Sampling
Hendrickson, Gerry F.; Green, Bert F., Jr. – 1972
It has been shown that Guttman weighting of test options results in marked increases in the internal consistency of a test. However, the effect of this type of weighting on the structure of the test is not known. Hence, the purpose of this study is to compare the factor structure of Guttman-weighted and rights-only-weighted tests and to relate the…
Descriptors: Analysis of Variance, Correlation, Factor Analysis, Item Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Attali, Yigal – ETS Research Report Series, 2007
This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…
Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)
Bayuk, Robert J. – 1973
An investigation was conducted to determine the effects of response-category weighting and item weighting on reliability and predictive validity. Response-category weighting refers to scoring in which, for each category (including omit and "not read"), a weight is assigned that is proportional to the mean criterion score of examinees selecting…
Descriptors: Aptitude Tests, Correlation, Predictive Validity, Research Reports
Downey, Ronald G.
Previous research has studied the effects of different methods of item option weighting on the reliability and concurrent and predictive validity of achievement tests. Increases in reliability are generally found, but with mixed results for validity. Several methods of producing option weights, (i.e., Guttman internal and external weights and…
Descriptors: Achievement Tests, Comparative Analysis, Correlation, Grade Point Average
Previous Page | Next Page »
Pages: 1  |  2