NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,636 to 1,650 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Helms, Janet E. – American Psychologist, 2009
In defending tests of cognitive abilities, knowledge, or skills (CAKS) from the skepticism of their "family members, friends, and neighbors" and aiding psychologists forced to defend tests from "myth and hearsay" in their own skeptical social networks (p. 215), Sackett, Borneman, and Connelly focused on evaluating validity coefficients, racial or…
Descriptors: Test Validity, Cognitive Ability, Error of Measurement, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Wicherts, Jelte M.; Millsap, Roger E. – American Psychologist, 2009
Sacked, Borne man, and Connelly recently discussed several criticisms that are often raised against the use of cognitive tests in selection. One criticism concerns the issue of measurement bias in cognitive ability tests with respect to specific groups in society. Sacked et AL. (2008) stated that "absent additional information, one cannot…
Descriptors: Prediction, Cognitive Tests, Cognitive Ability, Statistical Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Achenbach, Thomas M. – Journal of Clinical Child and Adolescent Psychology, 2011
The special section articles demonstrate the importance of informant discrepancies. They also illustrate challenges posed by discrepancies, plus opportunities for advancing research and practice. This commentary addresses these cross-cutting issues: (a) Discrepancies affect many kinds of assessment besides ratings of children's problems. (b)…
Descriptors: Measurement, Error of Measurement, Evaluation Methods, Young Children
Stewart, Orbie L. – ProQuest LLC, 2010
The central purpose of this study was to determine if teaching the subject matter of diversity or social psychology to military sponsored college students would change the level of sensitivity (behavior and attitude) to diversity as measured by an instrument called The Inventory of Cross Cultural Sensitivity developed by Kenneth Cushner (1986).…
Descriptors: College Students, Course Content, Cultural Pluralism, Social Psychology
Peer reviewed Peer reviewed
Direct linkDirect link
Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010
"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…
Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Trevor R.; Rogan, John M. – Biochemistry and Molecular Biology Education, 2010
Student assessment is central to the educational process and can be used for multiple purposes including, to promote student learning, to grade student performance and to evaluate the educational quality of qualifications. It is, therefore, of utmost importance that assessment instruments are of a high quality. In this article, we present various…
Descriptors: Educational Assessment, Educational Quality, Student Evaluation, Educational Research
Peer reviewed Peer reviewed
Direct linkDirect link
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010
Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…
Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics
Rothman, Robert – School Administrator, 2010
At a time when teacher quality has emerged as a key factor in student learning, a statistical technique that determines the "value added" that teachers bring to student achievement is getting new scrutiny. Value-added measures compare students' growth in achievement to their expected growth, based on prior achievement and demographic…
Descriptors: Teacher Effectiveness, Outcomes of Education, Teaching Methods, Accountability
Peer reviewed Peer reviewed
Direct linkDirect link
Psychological Methods, 2008
Reports an error in "Confidence intervals for gamma-family measures of ordinal association" by Carol M. Woods (Psychological Methods, 2007[Jun], Vol 12[2], 185-204). The note corrects simulation results presented in the article concerning the performance of confidence intervals (CIs) for Spearman's r-sub(s). An error in the author's C++ code…
Descriptors: Intervals, Computation, Error of Measurement, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Ludtke, Oliver; Marsh, Herbert W.; Robitzsch, Alexander; Trautwein, Ulrich; Asparouhov, Tihomir; Muthen, Bengt – Psychological Methods, 2008
In multilevel modeling (MLM), group-level (L2) characteristics are often measured by aggregating individual-level (L1) characteristics within each group so as to assess contextual effects (e.g., group-average effects of socioeconomic status, achievement, climate). Most previous applications have used a multilevel manifest covariate (MMC) approach,…
Descriptors: Statistical Analysis, Sampling, Context Effect, Simulation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2009
A series of resampling studies was conducted to compare the accuracy of equating in a common item design using four different methods: chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating, and the circle-arc method. Four operational test forms, each containing more than 100 items, were used for…
Descriptors: Sampling, Sample Size, Accuracy, Test Items
Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward – Educational Testing Service, 2009
This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…
Descriptors: Equated Scores, Test Items, Difficulty Level, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Forero, Carlos G.; Maydeu-Olivares, Alberto; Gallardo-Pujol, David – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Factor analysis models with ordinal indicators are often estimated using a 3-stage procedure where the last stage involves obtaining parameter estimates by least squares from the sample polychoric correlations. A simulation study involving 324 conditions (1,000 replications per condition) was performed to compare the performance of diagonally…
Descriptors: Factor Analysis, Models, Least Squares Statistics, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Schochet, Peter Z. – Evaluation Review, 2009
In social policy evaluations, the multiple testing problem occurs due to the many hypothesis tests that are typically conducted across multiple outcomes and subgroups, which can lead to spurious impact findings. This article discusses a framework for addressing this problem that balances Types I and II errors. The framework involves specifying…
Descriptors: Policy, Evaluation, Testing Problems, Hypothesis Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Pages: 1  |  ...  |  106  |  107  |  108  |  109  |  110  |  111  |  112  |  113  |  114  |  ...  |  220