American Educational Research Association , American Psychological Association, and National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. New York: Holt, Rinehart and Winston.
Deanna, L. M. & Pere, M.(2004). Setting Standards in Education:Choosing the Best Method for your Assessmrnt and Population. Education Testing Service.Unpublished paper
Ellis, R. (1994). The study of second language acquisition. Oxford: Oxford University Press.
Ellis, R. (1997). SLA research and language teaching. Oxford: Oxford University Press.
Erwin, T. D., & Wise, S. L..(2003). Standard setting. EBSCO Publishing.
Ferdous, A. & Plake, B.(2003).The use of Subsets of Test Questions in an Angoff Standard Setting Method. Paper presented at the annual meeting of the national council on measurement in education. (Chicago, IL, April 22-24, 2003).
Flanagan, J. C. (1951). Units, scores, and norms. In E. F. Lindquist (Ed.), Educational measurement . Washing, DC: American Council on Education.
Green, D. R.(2001). Interpreting the result of three different standard setting procedural. Paper presented at the annual meeting of the American Educational Research Association, Seattle, WA, April 14, 2001.
Hansen, I. V.(1985). Sex Differences in English Achievement. Highway one. 8(1-2),259-73
Jaeger, R. M. (1989). Certification of student competence. In R. L. (Ed.), Educational measurement . New York: American Council on Education/Macmillan.
Kaplan, D.(1993). The Impact of BIB-Spiralling Induced Missing Data Patterns on Goodness-of Fit Tests in Factor Analysis. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA, April 20-24, 1992.
Lewis D. M., Mitzel H. C., & Green, D. R.(1996). Standard Setting:A Bookmark Approach, presented at the CCSSO National Conference on Large Scale Assessment.
Linn, J. (2003). The bookmark standard setting procedural: strength and weakness. The centre for research in applied measurement and evaluation. The university of Alberta, Canada. Unpublished paper.
Linn, R. L., Ground, N. E.(2000). Measurement and assessment in teaching. Upper Saddle River, N. J.: Merrill.
MacIntyre, P. D., Baker, S. C., Clement, R., and Donovan, L. A.(2002). Sex and age effects on willingness to Communicate, anxiety, perceived competence, and L2 motivation among junior high school French immersion students. Language Learning, 52(3),537-64
Messick, S. (1992). Validity of test interpretation and use. In M.C. Alkin (Ed.), Encyclopedia of educational research, (1487-1495). New York: Macmillan.
Phakiti, A. (2003). A closer look at gender and strategy use in L2 reading. Language learning, 53(4),649-702.
Skaggs, G. ( 2001). Item disordinality with the bookmark standard setting procedural. Paper presented at the 2001 annual meeting of the national council on measurement in education, Seattle, WA.
U.S. Department of Education (1996). Goals 2000: A progress report. Washington, D. C.: The author.