ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	2

Descriptor

Statistical Analysis	41
Testing Problems	41
Test Reliability	36
Test Validity	13
Scores	10
Elementary Secondary Education	9
Test Construction	9
Measurement Techniques	8
Test Interpretation	7
Testing	7
Correlation	6
Equated Scores	6
Mathematical Models	6
Reliability	5
Standardized Tests	5
Test Bias	5
Achievement Gains	4
Achievement Tests	4
Criterion Referenced Tests	4
Data Analysis	4
Error of Measurement	4
Multiple Choice Tests	4
Norm Referenced Tests	4
Research Methodology	4
Test Theory	4
More ▼

Source

Educational and Psychological…	4
Applied Psychological…	2
Journal of Educational…	2
American Journal of Mental…	1
Canadian Journal of School…	1
Didakometry	1
Meas Evaluation Guidance	1
NCME Measurement in Education	1
Peabody Journal of Education	1
Res Quart AAHPER	1

Publication Type

Reports - Research	22
Journal Articles	8
Speeches/Meeting Papers	4
Collected Works - Serials	2
Books	1
Collected Works - General	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Reference Materials -…	1
Reports - Evaluative	1
More ▼

Education Level

Higher Education	2
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Practitioners	2
Researchers	1
Teachers	1

Location

California (Stanford)	1
Colorado (Denver)	1
Germany	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
California Achievement Tests	1
General Aptitude Test Battery	1
Metropolitan Achievement Tests	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
State Trait Anxiety Inventory	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

A Competency Model for Process Dynamics and Control and Its Use for Test Construction at University Level

Peer reviewed

Direct link

Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015

This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…

Descriptors: Models, Engineering Education, Test Items, Outcome Measures

Administration and Scoring Errors of Graduate Students Learning the WISC-IV: Issues and Controversies

Peer reviewed

Direct link

Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012

A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…

Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring

Parallel Measurements and the Spearman-Brown Formula

Peer reviewed

Burnett, J. Dale – Educational and Psychological Measurement, 1974

The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)

Descriptors: Statistical Analysis, Test Reliability, Testing Problems

Test Use and Test Reliability in a Curriculum for Educable Mentally Retarded Children. Working Paper Number 1.

Download full text

Smith, Leon I.; Greenberg, Sandra – 1973

A discussion of selected applications of new tests developed within the context of a large-scale curriculum for educable mentally retarded (EMR) children, the Social Learning Curriculum (SLC), is presented in this paper which investigates three types of reliability that need to be demonstrated in order to provide a basis of these applications. The…

Descriptors: Curriculum Evaluation, Educational Research, Evaluation Methods, Measurement Techniques

The Use of Bayes' Estimates in the Law of Comparative Judgment.

Peer reviewed

Kaiser, Henry F. – Educational and Psychological Measurement, 1980

The use of Bayes' estimates for proportions in the Law of Comparative Judgment is suggested to avoid sample proportions of zero and one. (Author)

Descriptors: Bayesian Statistics, Comparative Analysis, Reliability, Statistical Analysis

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory.

Peer reviewed

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980

Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)

Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Regression Effects on Part Scores Based on Whole-Score Selected Samples.

Peer reviewed

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984

Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)

Descriptors: Correlation, Intelligence Tests, Profiles, Scores

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

A Comparison Among Person-Fit Measures.

Frary, Robert B. – 1982

Three measures of person-fit (the extent to which an examinee's response pattern on a multiple-choice test is consistent with his ability as estimated by total score) were computed for students taking classroom tests under 12 different instructors at a comprehensive university. Supplementary questions on each test inquired concerning students'…

Descriptors: Higher Education, Multiple Choice Tests, Predictive Validity, Reliability

Stability of Physical Performance Test Scores

Baumgartner, Ted A. – Res Quart AAHPER, 1969

Descriptors: Measurement, Physical Education, Physical Examinations, Physical Fitness

Faking and Faking Detection on the Minnesota Counseling Inventory

Braun, John R.; Asta, Patricia – Meas Evaluation Guidance, 1969

This report is based on a paper presented at the annual meeting of the Educational Research Association of the New York State, Kiamesha Lake, New York, November 7, 1968

Descriptors: Adjustment (to Environment), College Freshmen, Measurement Instruments, Personality Assessment

Problems, Perspectives, and Practical Issues in Equating.

Peer reviewed

Weiss, David J., Ed. – Applied Psychological Measurement, 1987

Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)

Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis

Pitfalls In Assessing Test Speededness.

Peer reviewed

Rindler, Susan Ellerin – Journal of Educational Measurement, 1979

A sample of the literature on test speededness is reviewed; methods of assessing speededness are presented and criticized; the assumptions that underlie these methods are questioned, and alternate, multiple-administration methods are suggested. The importance of the effect of time limits is discussed. (Author/CTM)

Descriptors: Literature Reviews, Measurement Techniques, Reaction Time, Statistical Analysis

Alternatives to the Design of Manipulating a Variable to Compare Retarded and Nonretarded Subjects

Peer reviewed

Chapman, Loren; Chapman, Jean P. – American Journal of Mental Deficiency, 1975

Descriptors: Difficulty Level, Exceptional Child Research, Mental Retardation, Research Methodology

GATB: Does the Apparatus Make a Difference?

Download full text

Kapes, Jerome T. – 1975

Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…

Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Bormuth, John R.	2
ANDRADE, MANUEL	1
Algina, James	1
Andrulis, Richard S.	1
Asta, Patricia	1
Barford, Sean W.	1
Barker, Pierce	1
Baumgartner, Ted A.	1
Braun, John R.	1
Brennan, Robert L.	1
Budescu, David	1
Burnett, J. Dale	1
Chapman, Jean P.	1
Chapman, Loren	1
Cross, Lawrence H.	1
Cureton, Edward E.	1
Dombrowski, Stefan C.	1
Ebel, Robert L.	1
Engell, Sebastian	1
Erickson, Ronald	1
Frary, Robert B.	1
Frey, Andreas	1
Greenberg, Sandra	1
More ▼