NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20012
What Works Clearinghouse Rating
Showing 391 to 404 of 404 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mulvenon, Sean W.; Stegman, Charles E. – Journal of Educational Research & Policy Studies, 2006
As part of No Child Left Behind (NCLB) legislation, many states are using confidence intervals to determine a range of scores for evaluating a school system. More specifically, the states are employing confidence intervals to help minimize measurement error in determining a school system's performance. The methodology and techniques employed in…
Descriptors: Federal Legislation, Computation, Intervals, Error of Measurement
Cizek, Gregory J.; Husband, Timothy H. – 1997
The contrasting groups method is one of many possible methods for setting passing scores. The most commonly used method is probably that developed by W. H. Angoff (1971), but it has been suggested that the Angoff method may not be appropriate for many standard setting applications in education. The contrasting groups method is explored as an…
Descriptors: Cutting Scores, Educational Research, Educational Testing, Judges
Nandakumar, Ratna; Yu, Feng – 1994
DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…
Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Pena, Elizabeth D.; Spaulding, Tammie J.; Plante, Elena – American Journal of Speech-Language Pathology, 2006
Purpose: The normative group of a norm-referenced test is intended to provide a basis for interpreting test scores. However, the composition of the normative group may facilitate or impede different types of diagnostic interpretations. This article considers who should be included in a normative sample and how this decision must be made relative…
Descriptors: Guides, Simulation, Language Impairments, Child Language
Peer reviewed Peer reviewed
Direct linkDirect link
Graham, John W.; Taylor, Bonnie J.; Olchowski, Allison E.; Cumsille, Patricio E. – Psychological Methods, 2006
The authors describe 2 efficiency (planned missing data) designs for measurement: the 3-form design and the 2-method measurement design. The 3-form design, a kind of matrix sampling, allows researchers to leverage limited resources to collect data for 33% more survey questions than can be answered by any 1 respondent. Power tables for estimating…
Descriptors: Cost Effectiveness, Structural Equation Models, Psychological Studies, Data Collection
Kim, Seock-Ho; And Others – 1992
Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…
Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Lei, Pui-Wa; Chen, Shu-Ying; Yu, Lan – Journal of Educational Measurement, 2006
Mantel-Haenszel and SIBTEST, which have known difficulty in detecting non-unidirectional differential item functioning (DIF), have been adapted with some success for computerized adaptive testing (CAT). This study adapts logistic regression (LR) and the item-response-theory-likelihood-ratio test (IRT-LRT), capable of detecting both unidirectional…
Descriptors: Evaluation Methods, Test Bias, Computer Assisted Testing, Multiple Regression Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Roussos, Louis A.; Ozbek, Ozlem Yesim – Journal of Educational Measurement, 2006
The development of the DETECT procedure marked an important advancement in nonparametric dimensionality analysis. DETECT is the first nonparametric technique to estimate the number of dimensions in a data set, estimate an effect size for multidimensionality, and identify which dimension is predominantly measured by each item. The efficacy of…
Descriptors: Evaluation Methods, Effect Size, Test Bias, Item Response Theory
Peer reviewed Peer reviewed
Tang, K. Linda; Algina, James – Multivariate Behavioral Research, 1993
Type I error rates of four multivariate tests (Pilai-Bartlett trace, Johansen's test, James' first-order test, and James' second-order test) were compared for heterogeneous covariance matrices in 360 simulated experiments. The superior performance of Johansen's test and James' second-order test is discussed. (SLD)
Descriptors: Analysis of Covariance, Analysis of Variance, Comparative Analysis, Equations (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Long, Jeffrey D. – Psychological Methods, 2005
Often quantitative data in the social sciences have only ordinal justification. Problems of interpretation can arise when least squares multiple regression (LSMR) is used with ordinal data. Two ordinal alternatives are discussed, dominance-based ordinal multiple regression (DOMR) and proportional odds multiple regression. The Q[superscript 2]…
Descriptors: Simulation, Social Science Research, Error of Measurement, Least Squares Statistics
Tang, Huixing – 1994
This paper describes an item response theory (IRT) based method of differential item functioning (DIF) detection that involves neither separate calibration nor ability grouping. IRT is used to generate residual scores, scores free of the effects of person or group ability and item difficulty. Analysis of variance is then used to test the group…
Descriptors: Ability Grouping, Analysis of Variance, Goodness of Fit, Identification
Ito, Kyoko; Sykes, Robert C. – 1994
Responses to previously calibrated items administered in a computerized adaptive testing (CAT) mode may be used to recalibrate the items. This live-data simulation study investigated the possibility, and limitations, of on-line adaptive recalibration of precalibrated items. Responses to items of a Rasch-based paper-and-pencil licensure examination…
Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Difficulty Level
Sandler, Andrew B. – 1987
Statistical significance is misused in educational and psychological research when it is applied as a method to establish the reliability of research results. Other techniques have been developed which can be correctly utilized to establish the generalizability of findings. Methods that do provide such estimates are known as invariance or…
Descriptors: Analysis of Covariance, Analysis of Variance, Correlation, Discriminant Analysis
Rasor, Richard E.; Barr, James – 1998
This paper provides an overview of common sampling methods (both the good and the bad) likely to be used in community college self-evaluations and presents the results from several simulated trials. The report begins by reviewing various survey techniques, discussing the negative and positive aspects of each method. The increased accuracy and…
Descriptors: Community Colleges, Comparative Analysis, Cost Effectiveness, Data Collection
« Previous Page | Next Page
Pages: 1  |  ...  |  17  |  18  |  19  |  20  |  21  |  22  |  23  |  24  |  25  |  26  |  27