МОГучие способности новые приемы анализа больших данных
Литература
T. Barclay et al. Loading databases using dataflow. SIGMOD Record, 23(4), 1994.
J. Choi et al. ScaLAPACK: a portable linear algebra library for distributed memory computers – design issues and performance. Computer Physics Communications, 97(1-2), 1996. High-Performance Computing in Science.
C.-T. Chu et al. Map-Reduce for machine learning on multicore. In NIPS, pages 281-288, 2006.
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI, pages 137-150, 2004.
S. Dubner. Hal Varian answers your questions, February 2008.
M. Franklin, A. Halevy, and D. Maier. From databases to dataspaces: a new abstraction for information management. SIGMOD Rec., 34(4), 2005.
G. Graefe. Encapsulation of parallelism in the volcano query processing system. SIGMOD Rec., 19(2), 1990.
J Gray et al. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Data Min. Knowl. Discov., 1(1), 1997.
Greenplum. A unified engine for RDBMS and MapReduce, 2009.
F. R. Hampel et al. Robust Statistics - The Approach Based on Influence Functions. Wiley, 1986.
J. M. Hellerstein, P. J. Haas, and H. J. Wang. Online aggregation. In ACM SIGMOD, 1997.