NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 751 to 765 of 1,166 results Save | Export
Peer reviewed Peer reviewed
Shermis, Mark D.; And Others – Journal of Developmental Education, 1996
Describes a study to pilot-test a new reading assessment instrument designed to function in a computerized adaptive testing (CAT) environment. Indicates that the measure showed fair internal consistency and correlated well with other tests. Discusses advantages and disadvantages of CAT systems and describes the HyperCAT testing program. (23…
Descriptors: Computer Assisted Testing, Diagnostic Tests, Higher Education, Pilot Projects
Peer reviewed Peer reviewed
Ramsay, James O. – Psychometrika, 1989
An alternative to the Rasch model is introduced. It characterizes strength of response according to the ratio of ability and difficulty parameters rather than their difference. Joint estimation and marginal estimation models are applied to two test data sets. (SLD)
Descriptors: Ability, Bayesian Statistics, College Entrance Examinations, Comparative Analysis
Peer reviewed Peer reviewed
Cahan, Sorel – Educational and Psychological Measurement, 1989
Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…
Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models
Peer reviewed Peer reviewed
Suzuki, Shinobu; Rancer, Andrew S. – Communication Monographs, 1994
Finds that the two-factor solution of the Argumentativeness Scale and the Verbal Aggressiveness Scale was a reasonable overall fit to samples of both U.S. and Japanese college students; orthogonality of the two constructs (argumentativeness and verbal aggressiveness) held for both samples; and the two scales had satisfactory construct validity for…
Descriptors: Communication Research, Construct Validity, Cross Cultural Studies, Evaluation Methods
Peer reviewed Peer reviewed
O'Grady, Kevin E.; Medoff, Deborah R. – Multivariate Behavioral Research, 1991
A procedure for evaluating a variety of rater reliability models is presented. A multivariate linear model is used to describe and assess a set of ratings. Parameters are represented in terms of a factor analytic model, and maximum likelihood methods test the model parameters. Illustrative examples are presented. (SLD)
Descriptors: Comparative Analysis, Correlation, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Carroll, John B. – Intelligence, 1995
It is argued that the statements and accusations made by Stephen Jay Gould about the use of factor analysis are incorrect and unjustified and that tests properly designed for the purpose can adequately measure a "general" or "g" factor of intelligence, particularly in view of the developments in testing since "The…
Descriptors: Factor Analysis, Intelligence Tests, Measurement Techniques, Nature Nurture Controversy
Peer reviewed Peer reviewed
Bachman, Lyle F. – Language Testing, 2000
Reviews developments in language testing research and practice over the last 20 years, and suggests future directions in the areas of professionalizing the field and validation research. Argues that concerns for ethical conduct must be grounded in valid test use, so that professionalization and validation research are inseparable. (Author/VWL)
Descriptors: Ethics, Language Research, Language Tests, Second Language Instruction
Ozsevgec, Tuncay; Cepni, Salih – Online Submission, 2006
In order to determine students' achievement, science teachers have to develop their own assessment tools. This study attempts to find out the relationship between the teachers' assessment tools and students' cognitive development according to the teachers' teaching experiences. Six open-ended survey questions were developed and delivered to 59…
Descriptors: Foreign Countries, Correlation, Science Teachers, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Graham, James M. – Educational and Psychological Measurement, 2006
Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…
Descriptors: Models, Test Theory, Reliability, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Wilson, Mark; Allen, Diane D.; Li, Jun Corser – Health Education Research, 2006
This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME "Standards for Educational and…
Descriptors: Health Education, Self Efficacy, Health Behavior, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Boman, Peter; Curtis, David; Furlong, Michael J.; Smith, Douglas C. – Journal of Psychoeducational Assessment, 2006
The construct validity of the Australian version of the Multidimensional School Anger Inventory-Revised (MSAI-R) was examined using exploratory factor analysis (EFA), Rasch analysis, and confirmatory factor analysis (CFA) on a sample of 1,400 Australian students enrolled in Years 8 through 12. The EFA revealed a strong replication of the MSAI-R's…
Descriptors: Affective Measures, Psychological Patterns, Construct Validity, Reliability
Kennedy, Lauren Culzean – Online Submission, 2007
This research paper describes the benefits of using an activity-based rhetorical perspective to develop English for specific purposes (ESP) test specifications. This approach expands the potential of ESP test specifications to analyze and describe target language use (TLU) situations, TLU tasks, and ESP test tasks. Multiple activity systems are…
Descriptors: Freshman Composition, Tests, English for Academic Purposes, Rhetorical Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Tzu-Hua – Journal of Computer Assisted Learning, 2007
The web-based formative assessment developed in this research is named Formative Assessment Module of the Web-based Assessment and Test Analysis System (FAM-WATA). FAM-WATA is a multiple-choice web-based formative assessment module containing six effective strategies: 'repeat the test', 'correct answers are not given', 'query scores', 'ask…
Descriptors: Foreign Countries, Self Supporting Students, Student Attitudes, Internet
Hayward, Pamela A. – 1995
This review critiques the use of Lev Vygotsky's concept of the zone of proximal development (ZPD) in quantitative research that focuses on the role communication plays in learning. A study that makes claims in terms of the ZPD should include a pretest, a problem-solving activity, and a posttest. Without these minimal elements, researchers are not…
Descriptors: Communication Research, Communication (Thought Transfer), Learning Processes, Pretests Posttests
Lai, Morris K.; Saka, Thomas – 1993
Two studies investigated factors affecting the scores of Hawaii students taking the verbal subtest of the Scholastic Aptitude Test (SAT). For the past several years, the mean verbal scores of Hawaii students have consistently been among the lowest 10% of all states. The first study addressed the identification of items and types of items that have…
Descriptors: Comparative Analysis, High School Seniors, High Schools, Instructional Effectiveness
Pages: 1  |  ...  |  47  |  48  |  49  |  50  |  51  |  52  |  53  |  54  |  55  |  ...  |  78