Big data and NoSQL: their role on the cloud computing revolution and their challenges to standardization.
DOI:
https://doi.org/10.33304/revinv.v06n2-2015008Keywords:
Databases, Big Data, cloud computing, NoSQLAbstract
The access and usage boom of cloud computing oriented information technologies (IT) has generated huge amounts, and variety of data, or Big Data into the datacenters of the organizations than provide these kind of services, originating in that way, some challenges for IT managers and developers on handling efficiently those resources. In this phenomenon, relational databases are seen by IT developers as an obstruction to achieve the essential features of cloud computing. This has driven a set of new approaches and technologies for data management, known as no-relational databases or NoSQL, they are designed to provide speed, scalability, and elasticity, making easy the management of cloud computing technological solutions. In this papers are exposed the Big Data and NoSQL concepts, their main technological tools to data management, their role on the cloud computing revolution, and the challenges of this technologies to their standardization.Downloads
References
A. El Abbadi, “. D. (2010). Big Data y Cloud Computing : New Wine or just New Bottles? The VLDB Endowment VLDB Endowment Hompage archive, 3(1), 1647-1648.
Abu-libdeh, H., & Weatherspoon, H. (2010). RACS : A Case for Cloud Storage Diversity. RACS: A Case for Cloud Storage Diversity, 229240.
Across, P., & Hardware, H. (2007). The Hadoop Distributed File System: Architecture and Design. 26th Symposium on Mass Storage Systems and Technologies (MSST), 1-14. Obtenido de P. Across y H. Hardware, “The Hadoop Distributed File System: Architecture and Design,” 26th Symposium on Mass Storage Systems and Technologies (MSST). 2007. pp. 114.
Amazon. (09 de 04 de 2013). SimpleDB. O b t e n i d o d e http://aws.amazon.com/simpledbApache. (09 de 04 de 2013).
Apache Cassandra . Obtenido de http://cassandra.apache.org
Apache. (09 de 04 de 2013). ApacheCouchDB. Obtenido de http://couchdb.apache.org
Armbrust, B., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., . . . Rabkin, A. (2010). Above the Clouds: A View of Cloud Computing. Communications of the ACM, 53(4), 50-58.
Armbrust, M., Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., . . . Zaharia, M. (2009). Above the Clouds: A Berkeley View of Cloud Computing Cloud Computing: An Old Idea Whose Time Has ( Finally ) Come. 7-13.
Bartholomew, D. (2010). SQL vs . NoSQL The Case for NoSQL. Linux Journal, 2010, 1-8. Bhardwaj, S., Jain, L., & Jain, S. (2010). An Approach for Investigating Perspective of Cloud Software-as-a-Service (SaaS). International Journal of Computer Applications., 10(2), 44-47.
Bhardwaj, S., Jain, L., & Jain, S. (2010). Cloud computing: A study of infraestructure as a services ( IaaS ). Int. J. Eng. Inf. Technol., 2(1), 60-63.
Bobrowski, S. (2011). Optimal Multitenant Designs for Cloud Apps. 2011 IEEE 4th International Conference on Cloud Computing, 654-659.
Borkar, V., Carey, M., & Li, C. (2012). Inside Big Data Management: Ogres, Onions, or Parfaits? 15th International Conference on Extending Database Technology, 3-14.
Borthakur, D. R., & Gray, J. S. (2011). Apache hadoop goes realtime at Facebook. Proceedings of the 2011 international conference on Management of data - SIGMOD 11, 71. Obtenido de Borthakur, D., Rash, S., Schmidt, R., Aiyer, A.; Gray, J., Sen Sarma, J., Muthukkaruppan, K., Spiegelberg, N., Kuang, H., Ranganathan, K., Molkov, D., Menon, A. “Apache hadoop goes realtime at Facebook,” in Proceedings of the 2011 international conferenc.
Bowers, K., Juels, A., & Oprea, A. (2009). HAI : AHigh-Availability and Integrity Layer for Cloud Storage. The 16th ACM conference on Computer and communications security, 1-12.
Brewer, E. (2000). Towards Robust Distributed Systems. ACM Symposium on Principles of Distributed Computing, 7.
Buyya, R., Yeo, C. V., Broberg, J., & Brandic, I. (2009). “Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems, 25(6), 599-616.
Calder, B., Wang, J., Ogus, A., Nilakantan, N., Skjolsvold, A., Mckelvie, S., . . . Dayanand, S. (2011). Windows Azure Storage : A Highly Available Cloud Storage Service with Strong Consistency. 143-157.
Cattell, R. (2010). Scalable SQL and NoSQL Data Stores. 39, 12-27. Chang, F., Dean, J., Ghemawat, S., Hsieh, W., Wallach, D. B., Chandra, T., . . . Gruber, R. (2006). Bigtable : A Distributed Storage System for Structured Data.
Chang, V., Bacigalupo, D., Wills, G., & De Roure, D. (2010). V. Chang, D. Bacigalupo, G. Wills, y D. De Roure, “ACategorisation of Cloud Computing Business Models,” 10th IEEE. V.
Chang, D. Bacigalupo, G. Wills, y D. De Roure, “A Categorisation of Cloud Computing Busine ss Mode ls, ” ACM International Conference on Cluster, Cloud and Grid Computing, 509-512.
Choo, K. (2010). Cloud computing : Challenges and future directions. Trends & Issues in Crime and Criminal Justice, 1(400), 1-8. Corporation, E. (11 de 04 de 2013). CIO Top o f M i n d f o r 2 0 1 3 . O b t e n i d o d e http://www.idgconnect.com/view_abstract/1333 0/cio-top-mind-2013
Dean, J. A. (2010). MapReduce: A Flexible Data Processing Tool. Commun. ACM, 53(1), 72-77.
El Abbadi, E. (2011). Big Data and Cloud Computing : Current State and Future Opportunities. 14th International Conference on Extending Database Technology, 1-3.
F. Wang, J. Q. (2009). Hadoop high availability through metadata replication . Proceeding of the first international workshop on Cloud data management - CloudDB 09, 37. Foley, M. (11 de 04 de 2013). Microsoft drops Dryad; puts its big-data bets on Hadoop. O b t e n i d o d e http://www.zdnet.com/blog/microsoft/microsoft -drops-dryad-puts-its-big-data-bets-onhadoop/11226
Foster, I., Zhao, Y., & Raicu, I. L. (2008). Cloud Computing and Grid Computing 360- Degr e e Compa r ed. Grid Computing Environments Workshop, 1-10.
Gartner. (05 de 05 de 2013). Big Data, IT G l o s s a r y 2 0 1 2 . O b t e n i d o d e http://www.gartner.com/it-glossary/bigdata/Gartner. (03 de 05 de 2013).
Gartner definition of cloud computing. Obtenido de Gartner definition of cloud computing, 2012: http://www.gartner.com/it-glossary/cloudcomputing/
Ghemawat, S., Gobioff, H., & Leung, S. (2003). The Google file system. ACM SIGOPS Operating Systems Review, 37(5), 29. Godavari, W. (2010). Reviewing Some Platforms in Cloud. International Journal of Engineering, 2(5), 348-353.
Google. (09 de 04 de 2013). MapReduce O v e r v i e w . O b t e n i d o d e https://developers.google.com/appengine/docs/ python/dataprocessing/overview Haerder, T., & Reuter, A. (1983). Principles of transaction-oriented database recovery. ACM Computing Surveys, 15(4), 287-317.
Hu, W., Yang, T., & Matthews, J. (2010). The good, the bad and the ugly of consumer cloud storage. ACM SIGOPS Operating Systems Review, 44(3), 110.
IDC. (10 de 4 de 2013). IDC Latin America Predictions 2013. Obtenido de IDC Latin A m e r i c a P r e d i c t i o n s : http://www.idclatin.com/campaign/predictions/
Kambatla, K. P. (2009). Towards Optimizing Hadoop Provisioning in the Cloud. HotCloud 09, 22.
Konstantinou, I., Angelou, E., Tsoumakos, D., & Koziris, N. (2010). Distributed indexing of web scale datasets for the cloud. Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud - MDAC 10. , 1 -6.
Leavitt, N. (2010). Will NoSQLdatabases live up to their promise? Computer, 3(2), 12-14.
Marks, E., & Lozano, B. (2010). Executives Guide to Cloud Computing. Nueva York: Wisley & Sons, Inc. Mell, P., & Grance, T. (2008). The NIST D e f i n i t i o n o f C l o u d C o m p u t i n g : Recommendations of the National Institute of Standards and Technology. NIST Special Publication,, 145.
Microsoft. (07 de 04 de 2013). Dryad . Obtenido de http://research.microsoft.com/enus/projects/dryad
Mietzner, R., Unger, T., Titze, R., & Leymann, F. (2009). Combining Different Multi-Tenancy Patterns in Service-Oriented Applications. Enterprise Distributed Object Computing Conference, 131-140.
Nurmi, D., Wolski, R., Grzegorczyk, C., Obertelli, G., Soman, S., Youseff, L., & Zagorodnov, D. (2009). The Eucalyptus OpenSource Cloud-Computing System. 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 124-131.
Padhy, R., Patra, M., & Satapathy, S. (2011). “RDBMS to NoSQL: Reviewing Some NextGeneration Non-Relational Databases. International Journal of Advanced Engineering Science and Technologies, 11(1), 15-30.
Pearson, S., Shen, Y., & Mowbray, M. (2009). “A Privacy Manager for Cloud Computing. Lecture Notes in Computer Science, 5931, 90- 106.
Sciore, E., & Hill, C. (2007). SimpleDB : A Simple Java-Based Multiuser System for Teaching Database Internals. 28th SIGCSE Technical Symposium on Computer Science Education, 561565.
Shao, B., & Wang, H. (2012). Managing and Mining Large Graphs : Systems and Impl ement a tions. ACM Int e rna tiona l Conference on Management of Data (SIGMOD), 1, 589592.
Stonebraker, M. (2011). Stonebraker on NoSQL and enterprises. Communications of the ACM, 54(8), 10-11.
Strauch, C. (2011). NoSQL databases: a step to database scalability in web environment. 13th International Conference on Information Integration and Web-based Applications and Services, C. Strauch, “NoSQL databases: a step to database scalability in web environment.” 13th International Co278-283.
Trelles, O., Prins, P., Snir, M., & Jansen, R. (2011). Big data, but are we ready? Nature reviews. Genetics, 12(3), 224.
Vouk, M. (2008). Cloud computing Issues, research and implementations. ITI 2008 - 30th International Conference on Information Technology Interfaces, 31-40.
Wang, F. Q., Yang, J., Dong, B., Li, X., & Li, Y. (2010). Distributed Systems Meet Economics : Pricing in the Cloud. En H. Wang, Q. Jing, R. Chen, B. He, Z. Qian, & L. Zhou, 2nd USENIX conference on Hot topics in cloud computing (pág. 6).
Xiang, P., Hou, R., & Zhou, Z. (2010). Cache and Consistency in NOSQL Consistency hash algorithm. Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on, 2010, 6, 117-120.
Zhou, M., States, U., Grimmer, J., King, G., & Science, Q. (3 de 5 de 2013). The Age of Big D a t a . O b t e n i d o d e http://damfoundation.org/2012/02/the-age-ofbig-data
M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, I. Stoica, y M. Zaharia, “Above the Clouds : A Berkeley View of Cloud Computing Cloud Computing : An Old Idea Whose Time Has ( Finally ) Come,”. 2009. pp. 7–13.