ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Item Analysis	17
Research Methodology	17
Test Reliability	17
Test Validity	8
Test Items	4
Comparative Analysis	3
Difficulty Level	3
Questionnaires	3
Replication (Evaluation)	3
Robustness (Statistics)	3
Sex Differences	3
Criterion Referenced Tests	2
Evaluation Research	2
Factor Analysis	2
Item Response Theory	2
Journal Articles	2
Literature Reviews	2
Measurement Techniques	2
Measures (Individuals)	2
Meta Analysis	2
Psychometrics	2
Tables (Data)	2
Test Construction	2
Theories	2
Ability	1
More ▼

Source

Adolescence	1
Applied Measurement in…	1
British Journal of…	1
Developmental Psychology	1
International Journal of…	1
Journal of Experimental…	1
Journal of Interpersonal…	1
Journal of Special Education	1
Psychology in the Schools	1
Roeper Review	1

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	3
Reports - Descriptive	2
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Grade 10	1
Grade 4	1
Grade 7	1
Higher Education	1

Audience

Location

United Kingdom (Reading)

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	1
Matching Familiar Figures Test	1
National Longitudinal Study…	1
Stanford Binet Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Reliability and Validity of the Research Methods Skills Assessment

Peer reviewed
PDF on ERIC

Download full text

Smith, Tamarah; Smith, Samantha – International Journal of Teaching and Learning in Higher Education, 2018

The Research Methods Skills Assessment (RMSA) was created to measure psychology majors' statistics knowledge and skills. The American Psychological Association's Guidelines for the Undergraduate Major in Psychology (APA, 2007, 2013) served as a framework for development. Results from a Rasch analysis with data from n = 330 undergraduates showed…

Descriptors: Psychology, Statistics, Undergraduate Students, Item Response Theory

Misconceptions about the Naglieri Nonverbal Ability Test: A Commentary of Concerns and Disagreements

Peer reviewed

Direct link

Naglieri, Jack A.; Ford, Donna Y. – Roeper Review, 2015

Black and Hispanic students are undeniably underidentified as gifted and underrepresented in gifted education. The underrepresentation of the two largest groups of "minority" students is long-standing, dating several decades, and is a serious area of contention. Most debates focus on the efficacy of traditional intelligence tests with…

Descriptors: Misconceptions, Nonverbal Ability, Ability, Ability Identification

Replication and Robustness in Developmental Research

Peer reviewed

Direct link

Duncan, Greg J.; Engel, Mimi; Claessens, Amy; Dowsett, Chantelle J. – Developmental Psychology, 2014

Replications and robustness checks are key elements of the scientific method and a staple in many disciplines. However, leading journals in developmental psychology rarely include explicit replications of prior research conducted by different investigators, and few require authors to establish in their articles or online appendices that their key…

Descriptors: Replication (Evaluation), Robustness (Statistics), Developmental Psychology, Educational Research

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Reliability Reporting across Studies Using the Buss Durkee Hostility Inventory

Peer reviewed

Direct link

Vassar, Matt; Hale, William – Journal of Interpersonal Violence, 2009

Empirical research on anger and hostility has pervaded the academic literature for more than 50 years. Accurate measurement of anger/hostility and subsequent interpretation of results requires that the instruments yield strong psychometric properties. For consistent measurement, reliability estimates must be calculated with each administration,…

Descriptors: Research Methodology, Psychometrics, Psychological Patterns, Affective Behavior

Reliability of Marking in Eight GCE Examinations

Peer reviewed

Murphy, R. J. L. – British Journal of Educational Psychology, 1978

Eight recent General Certificate of Education (GCE) examinations, containing mainly free-response questions, were investigated in terms of their marking reliability. The tests of 200 randomly selected candidates from each subject were re-marked by a senior GCE examiner, and these marks were compared with the marks awarded previously as a result of…

Descriptors: Educational Psychology, Examiners, Grading, Item Analysis

Estimating Reliability and Generalizability in Coefficients in Two-Facet Designs.

Peer reviewed

Hopkins, Kenneth D. – Journal of Special Education, 1983

This article illustrates the use of generalizability theory in special education to estimate the reliability of a measure when there is more than one source of error in the universe of inference and how the effects from changing the number of items and/or raters can be evaluated. (Author)

Descriptors: Generalization, Item Analysis, Mathematics, Research Methodology

The Rathus Assertiveness Schedule: Reliability at the Junior High School Level

Peer reviewed

Vaal, Joseph J.; McCullagh, James – Adolescence, 1977

This research was an attempt to determine the usefullness of the Rathus Assertiveness Schedule with pre-adolescent and early adolescent students. Previously it has been used with outpatients, institutionalized adults, or with college students. The RAS is a thirty item schedule that was developed for measuring assertiveness. (Author/RK)

Descriptors: Adolescents, Assertiveness, Item Analysis, Junior High School Students

New Directions in Matching Familiar Figures Test Research Resulting From Scoring and Item Analyses.

Download full text

Brinzer, Raymond J. – 1979

The problem engendered by the Matching Familiar Figures (MFF) Test is one of instrument integrity (II). II is delimited by validity, reliability, and utility of MFF as a measure of the reflective-impulsive construct. Validity, reliability and utility of construct assessment may be improved by utilizing: (1) a prototypic scoring model that will…

Descriptors: Conceptual Tempo, Difficulty Level, Item Analysis, Research Methodology

Using Item Data for Evaluating Criterion Reference Measures with an Empirical Investigation of Index Consistency.

PDF pending restoration

Meredith, Keith E.; Sabers, Darrell L. – 1972

Data required for evaluating a Criterion Referenced Measurement (CRM) is described with a matrix. The information within the matrix consists of the "pass-fail" decisions of two CRMs. By differentially defining these two CRMs, different concepts of reliability and validity can be examined. Indices suggested for analyzing the matrix are listed with…

Descriptors: Criterion Referenced Tests, Factor Analysis, Item Analysis, Research Methodology

The Identification of Biased Items.

Download full text

Sinnott, Loraine T. – 1982

A standard method for exploring item bias is the intergroup comparison of item difficulties. This paper describes a refinement and generalization of this technique. In contrast to prior approaches, the proposed method deletes outlying items from the formulation of a criterion for identifying items as deviant. It also extends the mathematical…

Descriptors: College Entrance Examinations, Difficulty Level, Higher Education, Item Analysis

How Reliable Are Informal Reading Inventories?

Peer reviewed

Direct link

Spector, Janet E. – Psychology in the Schools, 2005

Informal Reading Inventories (IRI) are often recommended as instructionally relevant measures of reading. However, they have also been criticized for inattention to technical quality. Examination of reliability evidence in nine recently revised IRIs revealed that fewer than half report reliability. Several appear to have sufficient reliability for…

Descriptors: Informal Reading Inventories, Reading Instruction, Reading Difficulties, Reading Research

Development of a Work Sample Criterion for General Vehicle Mechanic.

Download full text

Engel, John D. – 1970

A work sample criterion test was developed for General Vehicle Repairman, MOS 63C30 and 63C40. Test items covered three task categories: troubleshooting, corrective action, and preventive maintenance. Thirty-eight organizational mechanics were tested at Fort Knox, Kentucky. Data were also collected on the quality of performance, for example, use…

Descriptors: Auto Mechanics, Criterion Referenced Tests, Equivalency Tests, Item Analysis

Linear Discriminant Analysis versus Logistic Regression: A Comparison of Classification Errors in the Two-Group Case

Peer reviewed

Direct link

Lei, Pui-Wa; Koehly, Laura M. – Journal of Experimental Education, 2003

Classification studies are important for practitioners who need to identify individuals for specialized treatment or intervention. When interventions are irreversible or misclassifications are costly, information about the proficiency of different classification procedures becomes invaluable. This study furnishes information about the relative…

Descriptors: Monte Carlo Methods, Classification, Discriminant Analysis, Regression (Statistics)

Automated Sentence Completion Scoring.

PDF pending restoration

Veldman, Donald J.

A 62-item form of the sentence-completion technique requiring one-word responses was administered to 1718 undergraduates in teacher education. The data were punched on cards and lists of different responses were compiled. Responses indicating evasion, hostility, anxiety and depression were identified for each stem to form a scoring "dictionary." A…

Descriptors: Affective Measures, College Students, Correlation, Data Processing

Previous Page | Next Page »

Pages: 1 | 2

Baumrind, Diana	1
Brinzer, Raymond J.	1
Claessens, Amy	1
Conger, Anthony J.	1
Dowsett, Chantelle J.	1
Duncan, Greg J.	1
Engel, John D.	1
Engel, Mimi	1
Ford, Donna Y.	1
Hale, William	1
Hopkins, Kenneth D.	1
Koehly, Laura M.	1
Lee, Yoonsun	1
Lei, Pui-Wa	1
McCullagh, James	1
Meredith, Keith E.	1
Murphy, R. J. L.	1
Naglieri, Jack A.	1
Sabers, Darrell L.	1
Sinnott, Loraine T.	1
Smith, Samantha	1
Smith, Tamarah	1
Spector, Janet E.	1
Taylor, Catherine S.	1
More ▼