NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 6,856 to 6,870 of 15,055 results Save | Export
Peer reviewed Peer reviewed
Wilcox, Rand R. – Educational and Psychological Measurement, 1981
This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)
Descriptors: Criterion Referenced Tests, Scoring, Test Reliability
Peer reviewed Peer reviewed
Uebersax, John S. – Educational and Psychological Measurement, 1982
A more general method for calculating the Kappa measure of nominal rating agreement among multiple raters is presented. It can be used across a broad range of rating designs, including those in which raters vary with respect to their base rates and how many subjects they rate in common. (Author/BW)
Descriptors: Mathematical Formulas, Statistical Significance, Test Reliability
Peer reviewed Peer reviewed
Feldt, Leonard S.; Charter, Richard A. – Measurement and Evaluation in Counseling and Development, 2003
Evaluating a test's reliability often requires dividing it into 3 or more unequal parts, which causes violation of the tau equivalence assumption of Cronbach's alpha. This article presents a criterion for abandoning alpha and an approach for computing a more appropriate estimate of reliability, the Gilmer-Feldt coefficient. (Author)
Descriptors: Counseling, Evaluation Methods, Psychometrics, Test Reliability
Peer reviewed Peer reviewed
Blau, Gary J. – Journal of Vocational Behavior, 1988
Examined the reliability and validity of a career commitment measure using employees (N=266) of newspaper and insurance companies. Results showed career commitment could be reliably measured and was operationally distinct from job involvement and organizational commitment. Discusses findings in terms of meaning of career commitment. (Author/ABL)
Descriptors: Careers, Employees, Test Reliability, Test Validity
Peer reviewed Peer reviewed
van der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988
Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)
Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability
Peer reviewed Peer reviewed
Traub, Ross E.; Rowley, Glenn L. – Educational Measurement: Issues and Practice, 1991
The idea of test consistency is illustrated, with reference to two sets of test scores. A mathematical model is used to explain the relative consistency and relative inconsistency of measurements, and a means of indexing reliability is derived using the model. Practical aspects of estimating reliability are considered. (TJH)
Descriptors: Mathematical Models, Test Reliability, True Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Zinbarg, Richard E.; Revelle, William; Yovel, Iftah; Li, Wen – Psychometrika, 2005
We make theoretical comparisons among five coefficients--Cronbach's [alpha], Revelle's [beta], McDonald's [omega][sub h], and two alternative conceptualizations of reliability. Though many end users and psychometricians alike may not distinguish among these five coefficients, we demonstrate formally their nonequivalence. Specifically, whereas…
Descriptors: Psychometrics, Test Reliability, Rating Scales, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Noreau, Luc; Lepage, Celine; Boissiere, Lucie; Picard, Roger; Fougeyrollas, Patrick; Mathieu, Jean; Desmarais, Gilbert; Nadeau, Line – Developmental Medicine & Child Neurology, 2007
The objectives of this study were: (1) to examine the psychometric properties of the Assessment of Life Habits (LIFE-H) for children; and (2) to draw a profile of the level of participation among children of 5 to 13 years of age with various impairments. The research team adapted the adult version of the LIFE-H in order to render it more…
Descriptors: Genetic Disorders, Head Injuries, Neurological Impairments, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Murdock, Linda C.; Cost, Hollie C.; Tieso, Carol – Focus on Autism and Other Developmental Disabilities, 2007
The "Social-Communication Assessment Tool" (S-CAT) was created as a direct observation instrument to quantify specific social and communication deficits of children with autism spectrum disorders (ASD) within educational settings. In this pilot study, the instrument's content validity and interrater reliability were investigated to determine the…
Descriptors: Nonverbal Communication, Autism, Content Validity, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Won-Chan – Applied Psychological Measurement, 2007
This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…
Descriptors: Simulation, Error of Measurement, Scoring, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Stein, L. A. R.; Katz, Brian; Colby, Suzanne M.; Barnett, Nancy P.; Golembeske, C.; Lebeau-Craven, R.; Monti, P. M. – Journal of Child & Adolescent Substance Abuse, 2007
The purpose of this study was to evaluate a brief version of the Alcohol Expectancy Questionnaire-Adolescent (AEQ-A; Brown, Christiansen, & Goldman, 1987). The original AEQ-A was reduced to seven items (called the AEQ-AB). Principal Components Analysis (PCA) was performed and two factors emerged (General Positive Effects and Potential Negative…
Descriptors: Alcohol Abuse, Drinking, Questionnaires, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Mackintosh, N. J. – Intelligence, 2007
Mackintosh and Bennett [Mackintosh, N. J., Bennett, E. S. (2005). What do Raven's Matrices measure? An analysis in terms of sex differences. "Intelligence, 33," 663-674] reported that male students obtained higher scores than females on Raven's items that required for their solution addition/subtraction or distribution of two rules, but…
Descriptors: Gender Differences, Sample Size, Scores, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Ginns, Paul; Ellis, Rob A. – British Journal of Educational Technology, 2009
This paper reports on the development of a scale for determining the quality of the student e-learning experience at the degree level when the student learning context is predominately a campus-based experience. Rapid developments in the use of information and communication technologies (ICT) in higher education require methods for evaluating the…
Descriptors: Undergraduate Students, Quality Control, Psychometrics, Student Experience
Brody, Leslie R.; Hay, Deborah H. – 1991
This paper reports on evaluations of a projective measure of self-esteem adapted from the Tasks of Emotional Development (TED). The evaluations were conducted in 7 studies with a total sample of 416 children and adults. The revised TED uses a five-point scoring system ranging from negative to positive self-esteem. Interrater reliability in the…
Descriptors: Adults, Children, Interrater Reliability, Measurement Techniques
Fitz, Don – 1984
The Client Observation Checklist (COC) was developed to evaluate Project ADAPT's intervention in three behavioral areas: bathing; dressing; and socialization. Project ADAPT is designed to provide services to meet the needs of chronically mentally ill residents of nursing homes. Specifically, the project provides staff trained to work with the…
Descriptors: Client Characteristics (Human Services), Hygiene, Institutionalized Persons, Interrater Reliability
Pages: 1  |  ...  |  454  |  455  |  456  |  457  |  458  |  459  |  460  |  461  |  462  |  ...  |  1004