Big data y NoSQL: su rol en la revolución del cloud computing y sus retos hacia la estandarización
DOI:
https://doi.org/10.33304/revinv.v06n2-2015008Palabras clave:
Bases de datos, Big Data, Cloud computing, NoSQLResumen
El auge en la utilización de las tecnologías de información (TI) orientadas a cloud computing ha generado grandes cantidades y variedades de datos, o Big Data, en los centros de datos de las organizaciones proveedoras de estos servicios, generando retos a los desarrolladores y administradores de las TI para el manejo eficiente de esos recursos. Frente a este fenómeno, las bases de datos relacionales son consideradas por los profesionales de las TI como una obstrucción para ofrecer las características esenciales del cloud computing. Esto ha impulsado una serie de nuevos enfoques y tecnologías para la gestión de datos, conocidos como bases de datos no relacionales o NoSQL, diseñadas para proveer rapidez, escalabilidad, alta disponibilidad y elasticidad, facilitando la administración de datos en las soluciones tecnológicas en cloud computing. En este artículo se exponen los conceptos de Big Data, sus principales herramientas tecnológicas para la gestión de datos, el movimiento NoSQL, el rol de estos en la revolución del cloud computing y los retos de estas tecnologías hacia la estandarización.Descargas
Citas
A. El Abbadi, “. D. (2010). Big Data y Cloud Computing : New Wine or just New Bottles? The VLDB Endowment VLDB Endowment Hompage archive, 3(1), 1647-1648.
Abu-libdeh, H., & Weatherspoon, H. (2010). RACS : A Case for Cloud Storage Diversity. RACS: A Case for Cloud Storage Diversity, 229240.
Across, P., & Hardware, H. (2007). The Hadoop Distributed File System: Architecture and Design. 26th Symposium on Mass Storage Systems and Technologies (MSST), 1-14. Obtenido de P. Across y H. Hardware, “The Hadoop Distributed File System: Architecture and Design,” 26th Symposium on Mass Storage Systems and Technologies (MSST). 2007. pp. 114.
Amazon. (09 de 04 de 2013). SimpleDB. O b t e n i d o d e http://aws.amazon.com/simpledbApache. (09 de 04 de 2013).
Apache Cassandra . Obtenido de http://cassandra.apache.org
Apache. (09 de 04 de 2013). ApacheCouchDB. Obtenido de http://couchdb.apache.org
Armbrust, B., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., . . . Rabkin, A. (2010). Above the Clouds: A View of Cloud Computing. Communications of the ACM, 53(4), 50-58.
Armbrust, M., Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., . . . Zaharia, M. (2009). Above the Clouds: A Berkeley View of Cloud Computing Cloud Computing: An Old Idea Whose Time Has ( Finally ) Come. 7-13.
Bartholomew, D. (2010). SQL vs . NoSQL The Case for NoSQL. Linux Journal, 2010, 1-8. Bhardwaj, S., Jain, L., & Jain, S. (2010). An Approach for Investigating Perspective of Cloud Software-as-a-Service (SaaS). International Journal of Computer Applications., 10(2), 44-47.
Bhardwaj, S., Jain, L., & Jain, S. (2010). Cloud computing: A study of infraestructure as a services ( IaaS ). Int. J. Eng. Inf. Technol., 2(1), 60-63.
Bobrowski, S. (2011). Optimal Multitenant Designs for Cloud Apps. 2011 IEEE 4th International Conference on Cloud Computing, 654-659.
Borkar, V., Carey, M., & Li, C. (2012). Inside Big Data Management: Ogres, Onions, or Parfaits? 15th International Conference on Extending Database Technology, 3-14.
Borthakur, D. R., & Gray, J. S. (2011). Apache hadoop goes realtime at Facebook. Proceedings of the 2011 international conference on Management of data - SIGMOD 11, 71. Obtenido de Borthakur, D., Rash, S., Schmidt, R., Aiyer, A.; Gray, J., Sen Sarma, J., Muthukkaruppan, K., Spiegelberg, N., Kuang, H., Ranganathan, K., Molkov, D., Menon, A. “Apache hadoop goes realtime at Facebook,” in Proceedings of the 2011 international conferenc.
Bowers, K., Juels, A., & Oprea, A. (2009). HAI : AHigh-Availability and Integrity Layer for Cloud Storage. The 16th ACM conference on Computer and communications security, 1-12.
Brewer, E. (2000). Towards Robust Distributed Systems. ACM Symposium on Principles of Distributed Computing, 7.
Buyya, R., Yeo, C. V., Broberg, J., & Brandic, I. (2009). “Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems, 25(6), 599-616.
Calder, B., Wang, J., Ogus, A., Nilakantan, N., Skjolsvold, A., Mckelvie, S., . . . Dayanand, S. (2011). Windows Azure Storage : A Highly Available Cloud Storage Service with Strong Consistency. 143-157.
Cattell, R. (2010). Scalable SQL and NoSQL Data Stores. 39, 12-27. Chang, F., Dean, J., Ghemawat, S., Hsieh, W., Wallach, D. B., Chandra, T., . . . Gruber, R. (2006). Bigtable : A Distributed Storage System for Structured Data.
Chang, V., Bacigalupo, D., Wills, G., & De Roure, D. (2010). V. Chang, D. Bacigalupo, G. Wills, y D. De Roure, “ACategorisation of Cloud Computing Business Models,” 10th IEEE. V.
Chang, D. Bacigalupo, G. Wills, y D. De Roure, “A Categorisation of Cloud Computing Busine ss Mode ls, ” ACM International Conference on Cluster, Cloud and Grid Computing, 509-512.
Choo, K. (2010). Cloud computing : Challenges and future directions. Trends & Issues in Crime and Criminal Justice, 1(400), 1-8. Corporation, E. (11 de 04 de 2013). CIO Top o f M i n d f o r 2 0 1 3 . O b t e n i d o d e http://www.idgconnect.com/view_abstract/1333 0/cio-top-mind-2013
Dean, J. A. (2010). MapReduce: A Flexible Data Processing Tool. Commun. ACM, 53(1), 72-77.
El Abbadi, E. (2011). Big Data and Cloud Computing : Current State and Future Opportunities. 14th International Conference on Extending Database Technology, 1-3.
F. Wang, J. Q. (2009). Hadoop high availability through metadata replication . Proceeding of the first international workshop on Cloud data management - CloudDB 09, 37. Foley, M. (11 de 04 de 2013). Microsoft drops Dryad; puts its big-data bets on Hadoop. O b t e n i d o d e http://www.zdnet.com/blog/microsoft/microsoft -drops-dryad-puts-its-big-data-bets-onhadoop/11226
Foster, I., Zhao, Y., & Raicu, I. L. (2008). Cloud Computing and Grid Computing 360- Degr e e Compa r ed. Grid Computing Environments Workshop, 1-10.
Gartner. (05 de 05 de 2013). Big Data, IT G l o s s a r y 2 0 1 2 . O b t e n i d o d e http://www.gartner.com/it-glossary/bigdata/Gartner. (03 de 05 de 2013).
Gartner definition of cloud computing. Obtenido de Gartner definition of cloud computing, 2012: http://www.gartner.com/it-glossary/cloudcomputing/
Ghemawat, S., Gobioff, H., & Leung, S. (2003). The Google file system. ACM SIGOPS Operating Systems Review, 37(5), 29. Godavari, W. (2010). Reviewing Some Platforms in Cloud. International Journal of Engineering, 2(5), 348-353.
Google. (09 de 04 de 2013). MapReduce O v e r v i e w . O b t e n i d o d e https://developers.google.com/appengine/docs/ python/dataprocessing/overview Haerder, T., & Reuter, A. (1983). Principles of transaction-oriented database recovery. ACM Computing Surveys, 15(4), 287-317.
Hu, W., Yang, T., & Matthews, J. (2010). The good, the bad and the ugly of consumer cloud storage. ACM SIGOPS Operating Systems Review, 44(3), 110.
IDC. (10 de 4 de 2013). IDC Latin America Predictions 2013. Obtenido de IDC Latin A m e r i c a P r e d i c t i o n s : http://www.idclatin.com/campaign/predictions/
Kambatla, K. P. (2009). Towards Optimizing Hadoop Provisioning in the Cloud. HotCloud 09, 22.
Konstantinou, I., Angelou, E., Tsoumakos, D., & Koziris, N. (2010). Distributed indexing of web scale datasets for the cloud. Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud - MDAC 10. , 1 -6.
Leavitt, N. (2010). Will NoSQLdatabases live up to their promise? Computer, 3(2), 12-14.
Marks, E., & Lozano, B. (2010). Executives Guide to Cloud Computing. Nueva York: Wisley & Sons, Inc. Mell, P., & Grance, T. (2008). The NIST D e f i n i t i o n o f C l o u d C o m p u t i n g : Recommendations of the National Institute of Standards and Technology. NIST Special Publication,, 145.
Microsoft. (07 de 04 de 2013). Dryad . Obtenido de http://research.microsoft.com/enus/projects/dryad
Mietzner, R., Unger, T., Titze, R., & Leymann, F. (2009). Combining Different Multi-Tenancy Patterns in Service-Oriented Applications. Enterprise Distributed Object Computing Conference, 131-140.
Nurmi, D., Wolski, R., Grzegorczyk, C., Obertelli, G., Soman, S., Youseff, L., & Zagorodnov, D. (2009). The Eucalyptus OpenSource Cloud-Computing System. 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 124-131.
Padhy, R., Patra, M., & Satapathy, S. (2011). “RDBMS to NoSQL: Reviewing Some NextGeneration Non-Relational Databases. International Journal of Advanced Engineering Science and Technologies, 11(1), 15-30.
Pearson, S., Shen, Y., & Mowbray, M. (2009). “A Privacy Manager for Cloud Computing. Lecture Notes in Computer Science, 5931, 90- 106.
Sciore, E., & Hill, C. (2007). SimpleDB : A Simple Java-Based Multiuser System for Teaching Database Internals. 28th SIGCSE Technical Symposium on Computer Science Education, 561565.
Shao, B., & Wang, H. (2012). Managing and Mining Large Graphs : Systems and Impl ement a tions. ACM Int e rna tiona l Conference on Management of Data (SIGMOD), 1, 589592.
Stonebraker, M. (2011). Stonebraker on NoSQL and enterprises. Communications of the ACM, 54(8), 10-11.
Strauch, C. (2011). NoSQL databases: a step to database scalability in web environment. 13th International Conference on Information Integration and Web-based Applications and Services, C. Strauch, “NoSQL databases: a step to database scalability in web environment.” 13th International Co278-283.
Trelles, O., Prins, P., Snir, M., & Jansen, R. (2011). Big data, but are we ready? Nature reviews. Genetics, 12(3), 224.
Vouk, M. (2008). Cloud computing Issues, research and implementations. ITI 2008 - 30th International Conference on Information Technology Interfaces, 31-40.
Wang, F. Q., Yang, J., Dong, B., Li, X., & Li, Y. (2010). Distributed Systems Meet Economics : Pricing in the Cloud. En H. Wang, Q. Jing, R. Chen, B. He, Z. Qian, & L. Zhou, 2nd USENIX conference on Hot topics in cloud computing (pág. 6).
Xiang, P., Hou, R., & Zhou, Z. (2010). Cache and Consistency in NOSQL Consistency hash algorithm. Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on, 2010, 6, 117-120.
Zhou, M., States, U., Grimmer, J., King, G., & Science, Q. (3 de 5 de 2013). The Age of Big D a t a . O b t e n i d o d e http://damfoundation.org/2012/02/the-age-ofbig-data
M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, I. Stoica, y M. Zaharia, “Above the Clouds : A Berkeley View of Cloud Computing Cloud Computing : An Old Idea Whose Time Has ( Finally ) Come,”. 2009. pp. 7–13.