МОГучие способности новые приемы анализа больших данных


  • T. Barclay et al. Loading databases using dataflow. SIGMOD Record, 23(4), 1994.
  • J. Choi et al. ScaLAPACK: a portable linear algebra library for distributed memory computers – design issues and performance. Computer Physics Communications, 97(1-2), 1996. High-Performance Computing in Science.
  • C.-T. Chu et al. Map-Reduce for machine learning on multicore. In NIPS, pages 281-288, 2006.
  • J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI, pages 137-150, 2004.
  • S. Dubner. Hal Varian answers your questions, February 2008.
  • M. Franklin, A. Halevy, and D. Maier. From databases to dataspaces: a new abstraction for information management. SIGMOD Rec., 34(4), 2005.
  • G. Graefe. Encapsulation of parallelism in the volcano query processing system. SIGMOD Rec., 19(2), 1990.
  • J Gray et al. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Data Min. Knowl. Discov., 1(1), 1997.
  • Greenplum. A unified engine for RDBMS and MapReduce, 2009.
  • F. R. Hampel et al. Robust Statistics - The Approach Based on Influence Functions. Wiley, 1986.
  • J. M. Hellerstein, P. J. Haas, and H. J. Wang. Online aggregation. In ACM SIGMOD, 1997.
  • W. Holland, February 2009. Downloaded from


  • W. H. Inmon. Building the Data Warehouse. Wiley, 2005.
  • Y. E. Ioannidis et al. Zoo: A desktop experiment management environment. In VLDB, 1996.
  • A. Kaushik. Web Analytics: An Hour a Day. Sybex, 2007.
  • N. Khoussainova et al. A case for a collaborative query management system. In CIDR, 2009.
  • K. Lange. Optimization. Springer, 2004.
  • M. T. Roth and P. M. Schwarz. Don't scrap it, wrap it! A wrapper architecture for legacy data sources. In VLDB, 1997.
  • M. Stonebraker. Inclusion of new types in relational data base systems. In ICDE, 1986.
  • M. Stonebraker et al. C-store: a column-oriented dbms. In VLDB, 2005.
  • M. Stonebraker et al. Requirements for science data bases and SciDB. In CIDR, 2009.
  • A. S. Szalay et al. Designing and mining multi-terabyte astronomy archives: the sloan digital sky survey. SIGMOD Rec., 29(2), 2000.
  • R. Vuduc, J. Demmel, and K. Yelick. Oski: A library of automatically tuned sparse matrix kernels. In SciDAC, 2005.
  • M.J. Zaki and C.-T. Ho. Large-Scale Parallel Data Mining. Springer, 2000.
  • Y. Zhang, H. Herodotou, and J. Yang. Riot: I/O-efficient numerical computing without SQL. In CIDR, 2009.

    Содержание  Назад