In place of buying, owning, and maintaining data centers and physical servers, we can get into technology services, such as computing power, storage, and databases from cloud service pro
Trang 2VIETNAM NATIONAL UNIVERSITY, HANOI
Trang 3FORM 3: LETTER OF DECLARATION
I hereby declare that the Graduation Project “Electronic Medical Record (EMR) Database And Big Data Using Microsoft Azure” is the results of my own research and has never been published in any work of others During the implementation process of this project, I have seriously taken research ethics; all findings of this project are results
of my own research and surveys; all references in this project are clearly cited according
Trang 4FORM 4: ACKNOWLEDGEMENT
First of all, I would like to express my sincere thanks to all teachers in International School – Vietnam National University for teaching and imparting valuable knowledge to me during the four years of studying and training at the school I want to express my deep gratitude to the instructor - Assoc.Prof.Dr Nguyen Thanh Tung (and his assistants MR
Vu Khanh Hoan), and my old teacher Mr.Phat, who enthusiastically guided me to implement this thesis as well as creating favorable conditions for me to have real contact, helping me gain more knowledge, practical and future job requirements
With my limited knowledge and research time, my thesis cannot avoid shortcomings I look forward to receiving comments and suggestions from teachers It will be a valuable experience for me to improve myself later
Sincerely thank!
Trang 5FORM 5: TABLE OF CONTENT
CHAPTER 1: OVERVIEW OF BIG DATA ON THE CLOUD COMPUTER PLATFORM 1
I Cloud Computing 1
I.1 Introduction about Cloud Computing 1
I.2 Components and structure of cloud computing 3
I.3 The Services of Cloud Computing 5
I.3.1 SaaS: Software as a Service 5
I.3.2 PaaS: Platform as a Service 7
I.3.3 IaaS: Infrastructure as a Service 9
I.4 Models of Cloud Computing 12
I.4.1 Public Cloud 12
I.4.2 Private Cloud (“Businesses” Cloud) 13
I.4.3 Hybrid Cloud 13
I.4.4 Community Cloud 14
I.5 The advantages/disadvantages of Cloud Computing and introduce some applications of its 14 I.5.1 The advantages and Disadvantages: 14
I.5.2 Some applications of Cloud Computing 17
I.6 Typical cloud applications 17
I.6.1 Google App Engine 17
I.6.3 Amazon Web Services 18
I.6.2 Windows Azure 19
I.7 The situation of apply Cloud Computing in the world and Vietnam 19
Trang 6II Big Data - Overview And Its Impact On Health Care 24
II.1 The definition of Big Data and What is Big Data in Health Care System and Why we need it? 24
II.1.1 The Definition Of Big Data 24
II.1.2 What Is Big Data In Health Care System? And Why We Need It? 24
II.2 The Characteristics Of Big Data 25
II.3 Stored in Big Data Analysis System 26
II.4 The advantages and challenges of using Big Data 31
II.3.1 The advantages of Big Data 31
II.3.2 The challenges of using Bing Data 32
II.5 The application and important of Big Data in Health Care System 34
II.2.2 The important of Big Data in Health Care 34
II.2.3 Examples of Big Data 37
II.6 What is NoSQL System? Why did I choose it? 37
III Overview about Cassandra Cosmos DB 40
III.1 Introduction, Definition, and features 40
III.2 Strength and Weakness 42
CHAPTER 2: APPLICATION AND RESULTS ACHIEVED 43
I Introduction about the tool Microsoft Azure 43
I.1 Introduction 43
I.2 Services of Microsoft Azure 43
I.3 Why did I choose Microsoft Azure? 45
II Application in the project - health care system 51
II.1 About the data of the topic 51
Trang 7II.2 Setup Big Data 54
II.3 Demo the result 63
III Conclusion 65
III.1 Assessment Of Project 65
III.2 Contributions of the project: 65
III.3 The Limitations of the project 67
References 69
Trang 9FORM 6: TABLE OF NOTATIONS AND ABBREVIATIONS
Version
Interface
Trang 11FORM 7: LIST OF TABLE AND FIGURE
1-three types of cloud computing 5
2-comparison between SQL and NoSQL 39
3-the regions serving of Microsoft Azure 47
4- data filed 52
5-data field 2 52
7-create Big Data 1 54
8-create Big Data 2 54
9-create Big Data 3 55
10-create Big Data 5 55
11-create Big Data 6 56
12-create Big Data 7 56
13-create Big Data 8 57
14-create Big Data 9 58
15-create Big Data 10 58
16-Create Table in Big Data 1 59
17-Create Table in Big Data 2 59
18-Input data 60
19-Connect to Cassandra 61
20-Connect to Cassandra 2 62
21-query data 63
22-demo 1 64
23-demo 2 64
Trang 13From such a typical problem, we see that if there is a trusted place to help hospitals manage that data well, they will no longer care about infrastructure and technology but only focus Mainly, their medical examination and treatment will bring them more efficiency and profit
as well as more convenience for patients
The terms "big data" and "cloud computing" came from one of these circumstances The term is also derived from the idea of putting everything such as data, software, calculations, etc on the Internet We will no longer see computers, servers of hospitals alone to store data and software, but only several "virtual servers" gathered on the network "Virtual servers" will provide services that make it easier to manage data; they will only pay for their service usage, without having to invest heavily in infrastructure as well as paying much attention to technology
This trend not only helps hospitals, but also helps many other small and medium-sized enterprises and businesses that do not have sufficient network infrastructure, servers to store and manage data well
However, due to time and knowledge constraints, in this report I will only delve into Big Data in the field of health care on Cloud computing platform Simultaneously, demo application of Microsoft Azure software to manage patient information
Trang 15CHAPTER 1: OVERVIEW OF BIG DATA ON THE CLOUD COMPUTER PLATFORM
I Cloud Computing
I.1 Introduction about Cloud Computing
Cloud computing is the most significant thing in the information technology industry, almost all of the biggest critics agreeing cloud computing is one of the most important paradigm changes in the early 21st century But that's all and where does it come from? And how does cloud computing history take shape?
According to Amazon, “Cloud computing is an on-demand distribution of IT resources over the Internet with a usage-based payment policy In place of buying, owning, and maintaining data centers and physical servers, we can get into technology services, such
as computing power, storage, and databases from cloud service providers like Amazon Web Services (AWS)” [1]
Cloud computing originated in the 1950s when the mainframe was born The cost of these mainframe devices is so large that organizations cannot afford to buy them
economically This is the time when the idea of providing shared access to a single
computer was born to companies to save costs [2]
In the 1970s, an operating system called the VM appeared This operating system can run
on any virtual machine, has its memory and other infrastructure, allowing data sharing This makes the concept of virtualization in computers popular
In the 1990s, telecom researchers and operators began to provide virtual private network equipment, with good service quality and low cost This is a way for telecommunications companies to provide multiple users with access to a single infrastructure
Another factor that also influences cloud computing is grid computing Grid computing allows major issues to be solved through parallel computing Computer utilities facilitate computing resources to be provided as measurement and SaaS services that allow
Trang 16subscribers to be posted on the Internet and applications Since then, cloud computing was officially born Cloud computing was born of all these factors
We can see that within a minute, applications like Facebook have more than 4 million new "likes", 300 hours of YouTube videos are posted, and more than 100 thousand calls are made on the platform Skype So what did they do to be able to store and serve such a huge amount of data? The obvious answer is Cloud Computing
With the development of cloud computing, it has gained a lot of trust from organizations
of all types, sizes, and industries that are using this service for many use cases, such as data backup, disaster recovery, email, virtual desktops, software development, and
testing, big data analytics, and interactive web applications with customers.[3]
For example:
A simple illustration to unravel the definition of cloud computing Before the 2000s, users who wanted to use what services had to buy the copyright, CD to install on personal computers such as accounting software, address management, phone numbers, and staff management This trend at that time proved to be quite effective when just a few taps, users can find out the necessary information about a certain employee or an address and phone number of an individual However, all data is stored on a personal computer so the mobility of these services is not high unless the user owns a laptop Nobody does that anymore With just a PDA or an internet-connected personal computer, individual users can search for essential information stored somewhere on the internet There are many services about managing personal information, email on the internet that users can exploit such as Yahoo mail, Gmail, Facebook; Companies no longer have to buy software
licenses for employee management, financial management, no longer have to invest in powerful computers to host the company's data Instead, the company only needs to pay
to use the service, more specifically, all the software for employee management and financial management, installed by a service provider at a power center certain math Servers with large amounts of memory are also installed by the service provider and the company can store the company's data Or as Health Care Companies are using cloud
Trang 17services to develop more tailored treatments for patients Financial services companies are using cloud services to enhance detection and prevent scams in real-time
Besides that video, game makers are using cloud services to provide online games to millions of players around the world
Thus, in the past, to deploy an application (for example, a Web site), you had to buy/rent one or more servers, and then set the server at data centers ), now cloud computing allows you to streamline the buying renting process You just need to state your
requirements; the system will automatically gather free resources to meet your
requirements
I.2 Components and structure of cloud computing
Cloud computing models include the Front end and Back end These two components are connected via a network, in most cases The front end is the transport vehicle through which the user interacts with the system; the back end is the cloud The back delivers applications computers servers and data storage to create a cloud of services The front consists of a business computer or network and applications used to access the cloud the infrastructure of today's cloud computing is a combination of reliable services delivered
by the world's leading information technology developers such as Microsoft, IBM,
Google based on virtualization technology (virtualized)[4] Cloud computing is divided into 4 basic layers with mutual interaction including:
o Cloud Application
The application layer of the cloud is to distribute software as a service via the Internet Users do not need to install such applications on their devices Applications are easy to modify and users can easily receive support from the service provider
Key features of the application layer include:
- Activities are managed in the cloud (Internet), not on the client-side, Customers access applications remotely via the Internet environment, the main interface is the website
Trang 18- All operations such as version updates, patches are done in the cloud (service provider) users do not need to perform these operations
o Cloud Programing
It provides a platform for the computing environment and solutions for computing
services, governs the infrastructure of cloud computing, and is the fulcrum for the
application layer to allow applications to operate on that platform Reduce the cost of deploying applications These include Web 2.0, Mashups, and Concurrent and
Distributed…
o Cloud infrastructure
Providing infrastructure of computers and devices in the cloud environment
(virtualization) Instead of customers having to invest money to buy servers, software, connected devices or rent physical infrastructure at data storage centers With the cloud application, all customer requirements will be provided immediately with a much lower, even free investment This is also the most prominent feature in the cloud-computing model
o Cloud Resources
Including the entire server, the connected device is designed and built specifically to provide and operate cloud-computing services Often-specialized devices with super-powerful multithreading computing capabilities to meet the entire mining needs of a large number of users and their growing needs
Trang 19I.3 The Services of Cloud Computing
The components of cloud computing, such as front-end, back-end and cloud-based
distribution and network use Together, they form a complete architecture of cloud
computing In addition to the main components, such as SAAS, PAAS and IAAS, there are all 11 components in cloud computing
However, in this report, I will only analyze three main types: IaaS, Paas and Saas
1 -three types of cloud computing
I.3.1 SaaS: Software as a Service
Software is also known as cloud application service, this name represents many popular enterprise services in the cloud market using the internet to contribute applications
Trang 20Third-party management for its users SaaS uses the internet to deliver applications, managed by third parties, to its users Most SaaS applications run directly through the web browser and do not require downloading or installing anything from the user
o The Distribution of SaaS
Because SaaS's distribution model is web-based, SaaS eliminates the need for IT staff to download and install applications on each individual computer With SaaS, vendors will manage all potential technical issues, such as data, middleware, servers, and storage, allowing businesses to manage and support online
o The Benefits of SaaS
SaaS provides numerous benefits for employees and the company by significantly cutting down the time and money for boring tasks such as installing managing and upgrading software security This gives technical staff more time to focus on the company's urgent issues
o The Characteristics of SaaS
There is a way to help you determine when businesses should use SaaS:
• Service management from a central location
• Set on the remote server
• Accessibility via the internet
• Users are not responsible for hardware and software updates
o When to use SaaS?
There are many separate specific situations that SaaS may be the best choice, including:
• If you are a startup or small company, you want to launch e-commerce services quickly without need time and staff for server and software issues
• For short-term projects that require cooperation
Trang 21• If you use applications that do not require frequent use, such as tax software
• For applications that need accessibility from the web and mobile
o The Example of SaaS
Google Apps, Cisco Webex, Drop box
I.3.2 PaaS: Platform as a Service
Cloud-based services, or Platform as a Service (PaaS), provide cloud components to certain software while still being used for core applications Users can use and build custom applications thanks to the framework provided by the PaaS The enterprise or a third-party provider can manage all servers, storage, and networking, while developers can maintain application management
Provides computing platform and a set of multi-layer solutions It supports application deployment regardless of the cost or complexity of equipping and managing the
underlying hardware and software layers, providing all the features needed to support the cycle Run fully by building and delivering an Internet-ready application and web service without any download or installation of software for developers, IT managers, or end-users It is also known by another name as cloud ware
The provision of platform services (PaaS) includes the conditions for application design, development, testing, deployment, and hosting of valuable applications such as
application services such as collaboration and packaging, web service integration and integration, database integration, security, scalability, state management, application version, and community benefits co-development and applied research These services are prepared as a web-based integration solution
o The distribution of PaaS
PaaS's distribution model is similar to SaaS, except that instead of distributing software over the internet, PaaS provides a platform for creating applications The platform is
Trang 22distributed over the web and provides developers with free deployment, able to focus on building software while not having to worry about or worry about operating systems, updates, or updates Software updates, or infrastructure
PaaS enabled businesses to allow us to design and create applications that are integrated into SaaS with special software components, these applications or middleware are highly scalable and available when they have special features Certain cloud points
o The Benefit of PaaS
No matter what size your company is, there are many different benefits to using PaaS:
• Making developing and deploying applications easier and more cost-effective
• Easy scalability
• High availability
• Gives developers the ability to create custom applications without the hassle of
maintaining software
• Significant reduction in the number of lines of code
• Automated business policy
• Allows easy integration with hybrid models
o The Characteristics of PaaS
PaaS has many features that define it as a cloud service, including:
• Built on virtualization technology, meaning resources can easily scale up or down as your business changes
Trang 23• Provides many services to develop, test, and deploy applications
• Multiple users can access the same service application at the same time
• Integrated web services and databases
o When to use PaaS?
There are many cases when using PaaS is advantageous or even necessary if multiple developers are working on the same development project or together with other vendors, PaaS can provide great speed and flexibility to the whole process
You can completely create your custom applications, this cloud service can significantly reduce costs, and it can simplify the challenges that arise if you require rapid
development or application deployment Thanks to the benefits of PaaS
o The Example of PaaS
AWS Elastic Beanstalk, Google App Engine, Windows Azure
I.3.3 IaaS: Infrastructure as a Service
Cloud infrastructure services, known as Infrastructure as a Service (IaaS), are created from highly scalable and automated computing resources IaaS is completely self-serving
to access and monitor everything like computers, networking, storage, and other services, and it allows businesses to buy resources on demand and needs instead of having to buy the entire hard part themselves
Stacking virtual architectures is an example of a trend where everything is served and has something in common Rather than a server for rent, central storage space or network equipment, workstations instead of investing to buy the resources, it is possible to hire full external services These services are usually charged based on functionality and the amount of resources used (and hence the cost) will reflect the level of activity Full of development of web hosting solutions and virtual personal servers
Trang 24The original name used was hardware service (HaaS) and was created by a Nicholas Car economist in March 2006, but this is necessary However, the word was gradually
replaced by the concept of infrastructure services around the end of 2006
o The Distribution of IaaS
IaaS distributes cloud-computing infrastructure, including everything like servers,
networks, operating systems, and storage, through virtualization technology These
specific cloud services are provided to the organization through a control panel or an API, an IaaS customer has complete control over the entire infrastructure
IaaS provides technologies and capabilities as a traditional storage center, without having
to maintain or manage the hardware of all those components IaaS customers may still be able to access their servers and storage directly, but external partners via virtual data centers in the cloud hire all
Unlike SaaS and PaaS, IaaS customers are responsible for application management, runtime, operating system, middleware and data However, IaaS providers manage
servers, hard drives, networks, virtualization, and storage Some vendors provide even more services outside of the virtualization layer, such as databases
o The Benefits of IaaS
There are a large number of benefits to choosing IaaS such as:
• It is the most flexible cloud model
• Easily enable storage deployment, networking, server, and automated processing power
• Additional hardware may be purchased based on usage
• Provide customers with complete control of their infrastructure
• Resources may be purchased as needed
• Easy to expand high
Trang 25o The Characteristics of IaaS
Few features to consider when considering IaaS:
• Resources available as a service
• Costs vary depending on consumption
• Highly scalable services
• There are usually multiple users on a specific hardware
• Provide complete control of the organization's infrastructure
• Flexible
o When to use IaaS
As with SaaS and PaaS, some specific situations are the best time to use IaaS IaaS will
be the best choice because you do not need to spend money and time trying to create hardware and software if you are startup or small company IaaS is also beneficial for large organizations that want complete control over their applications and infrastructure, but they just want to invest in what is needed IaaS may be a good choice because you do not need to upgrade or replace hardware or software to meet development needs for fast-growing company It also helps if you are not sure about the requirements that a new application will require because of its flexibility in scalability or reduction as needed
o The Example of IaaS
Digital Ocean, Google Compute Engine (GCE)
Conclusion: SaaS, PaaS and IaaS
Trang 26In general, each cloud service model provides specific features and functions, and most importantly, your organization needs to understand the differences between these
services Whether you're looking for a cloud-based software for storage purposes, a
platform that allows you to create custom applications, or want complete control of your infrastructure without having to maintain your Physical component, there will be a cloud service for you It does not matter which service you choose, but moving to the cloud is the future for business and technology models, and it is necessary
I.4 Models of Cloud Computing
Cloud Computing has four service models (product models):
o Public Cloud: services on the Cloud Computing platform to rent to individuals and organizations, they share resources
o Private Cloud: used in a business and not shared with users outside the enterprise
o Hybrid Cloud: A hybrid model (hybrid) between the Public Cloud and Private Cloud models
o Community Cloud: is a service based on Cloud computing platform built by companies that work together and provide services to the community
I.4.1 Public Cloud
Definition: a third party (seller) provides the services They exist outside the company's firewall and are managed by the cloud provider It is built for public use; users will
register with the provider and pay a fee based on the provider's pricing policy Public cloud is the most commonly used deployment model of cloud computing
Target users: Including users outside the internet The object of management is the
service provider
Advantages:
Serve more users, not limited by space and time
The saving server system, electricity, and labor for businesses
Disadvantages:
Trang 27 Suppliers-dependent businesses do not have full management rights
Having difficulty storing internal documents and information
However, the Public Cloud has a problem, the data loss, and data security issue All data
on the Cloud service, which is protected and managed by the Cloud service provider in this model This makes customers, especially large companies; feel insecure about their important data when using Cloud services
I.4.2 Private Cloud (“Businesses” Cloud)
Definition: Private cloud is the cloud computing services provided in businesses It is the inevitable trend when these "clouds" exist within the company's firewall and are directly managed by businesses
Target users: Internal use and management by the enterprise
Advantages: Actively using, upgrading, managing, reducing costs, good security,
Disadvantages:
o Difficulties in technology deployment and cost of system maintenance
o Restricting internal use within the enterprise, external users cannot use
I.4.3 Hybrid Cloud
Definition: A combination of private cloud and public cloud It allows us to exploit the strengths of each model as well as offer optimal usage methods for users These "clouds" are often created by businesses and their management will be split between businesses and public cloud providers
Target users: Enterprise and supplier managed by agreement Users can use the service provider and from there start their business service
Advantages: Enterprises can use many services at the same time without limitation Disadvantages: Difficult to deploy and manage It costs a lot of money
Trang 28I.4.4 Community Cloud
Community cloud (are services based on Cloud computing platform built by companies that work together and provide services to the community)
I.5 The advantages/disadvantages of Cloud Computing and introduce some applications of its
I.5.1 The advantages and Disadvantages:
o Advantages:
Thus, in the past, to deploy an application (for example, a website), you had to buy/rent one or more servers, then rent a server at the data centers, now cloud computing allows you to reduce the buying / renting process You just need to state your requirements; the system will automatically gather free resources to meet your requirements Therefore, there are some basic benefits of cloud computing as follows:
+ Use dynamic computing resources: Resources are allocated to businesses exactly what the business wants immediately by mobilizing free resources available on the internet + Reduce costs: Cost reduction: Businesses will be able to cut costs to purchase, install and maintain resources
Instead, they only need to identify their needs and then ask the cloud service providers
to proceed
The amount of savings provided by cloud computing is significant Many businesses report significantly reducing costs when switching to cloud computing Moreover, cloud computing also saves time Many difficult tasks exist in the business world and with different types of software, difficult tasks can be solved somewhat
+ Reduce the complexity in the structure of the business: Outsourcing the work of setting
up and operating the IT apparatus, the business will only focus on the production of its specialized goods and reduce the complexity in the structure
Trang 29+ Increase the use of computing resources, storage and expansion: One of the headache questions of investing resources (ex: servers) is how long it will run out of depreciation, I invest as is it profitable or not, is it out of date on technology or not When using
resources on cloud computing, you no longer need to care about this anymore
Through the provision of software companies, customers are provided free of charge with cloud data storage An extremely large storage space This eliminates the need to
purchase related storage areas Moreover, as this storage exists in cloud computing, sorting and organizing become a simpler task
+ Mobility: Cloud computing exists within the limitations of the Internet As such, access becomes much easier Your programs, files, and photos stored in the cloud can be
accessed anywhere with an Internet connection In addition, these documents can be accessed through methods other than a computer, such as a tablet or a smartphone
Through this mobility, emergency files you can gain access to any needed emergency + Durability: Cloud computing is also extremely reliable Usually, most software
development companies provide excellent customer support When the customer support staff often has a high handling experience, troubleshooting is much less hassle
o Disadvantages:
While there are many advantages, however, cloud computing technology also has some disadvantages such as:
+ Monthly cost: Probably the most prominent and common disadvantage for cloud
computing will be the monthly cost Using cloud computing usually requires bundled software this can cost some extra software With each custom program created, the cost will increase further
Some examples of ways to calculate costs in the Cloud:
Fixed pricing: the supplier will specify the specification of a fixed computing capability (allocated memory capacity, CPU type, and speed, etc.)
Unit pricing: popularly applied to the amount of data transferred, the amount of memory allocated and used, etc This method is more flexible than the above method
Trang 30Subscription pricing: most applications in the software service model (SaaS) users will predict cloud application usage norms (this is often difficult to achieve high accuracy) + Dead time: All the time that needs to be taken care of is when the server downtime This can occur for a variety of reasons such as weather updates, malware, or system malfunctions If the backup server is not working, your job could be severely hampered Providers for cloud computing services will be difficult because of taking care of a large number of customers at a time With a large number, companies or servers can be
sometimes overwhelmed and some technical problems may occur
+ Compatibility: When integrating a completely new system, compatibility issues may or may not This is especially true that companies want to retain a program before exiting Although it may appear fine and specific functions may result in a program failure,
resulting in compatibility or incompatibility issues occurs
+ Difficult to manage and secure: Because these systems often focus on the advantage of sharing, sharing resources on the cloud, or the internet From there, to implement
management, security, and prevention of hackers is quite difficult Hackers can access illegal information and steal data, causing a great loss for the company Moreover, those who have access to the server may use the information for malicious purposes In the situation that the tactics of attack, sabotage of hackers to profit more and more popular
In the future, we are sure that we will focus more on security for this cloud platform + Handling the data type of obsolete data: Data storage in the cloud system for too long is one of the difficult problems Such as at Google driver Some user accounts store lots of files there However, these files have not been accessed or edited and used for many years Combining so many types of data, our storage space will be wasted Since then, many hosting companies often warn customers about the types of data that are outdated, which they will delete in the future to avoid wasting storage resources
Conclusion: Cloud computing offers a multitude of advantages and many disadvantages In most cases, however, companies and businesses will find cloud computing to be extremely useful for their development and maintenance with this advanced technology
Trang 31I.5.2 Some applications of Cloud Computing
- Cloud database
- Testing and development
- Saving
- Big Data Analysis
- Store and share data
- Backup and restore data
- Enterprise management application
I.6 Typical cloud applications
More and more companies participating in the development of typical cloud applications such as Microsoft, Google, Intel, and IBM have been creating a large market of cloud applications The research team would like to offer a few typical cloud applications: I.6.1 Google App Engine
Google App Engine (GAE) allows us to deploy our application on Google's
infrastructure Building applications with App Engine is easy, convenient during
maintenance, easy to expand when there is increased traffic, or additional storage needs GAE supports us with 2 application development environments: the Java runtime
environment and the Python runtime environment
Java execution runtime environment: You can build applications based on Java
technology including the Java virtual machine (JVM), the Java servlet, and the Java programming language - or any language compiled on the JVM such as JavaScript or Ruby
Python Runtime Environment: is a specialized environment including a very fast
interpreter and a standard library
Some features of Google App Engine:
• Support the most popular, most popular web technologies
• Automatic expansion and load balancing
Trang 32• Provide API to authenticate users and send mail using Google accounts
• Provide a development environment with full features
I.6.3 Amazon Web Services
Amazon Web Services is a collection of services that provide programmers with access
to Amazon's ready-to-use computing architecture infrastructure Well-built computers that have been built and refined over the years by Amazon can give anyone access to the Internet Amazon provides several Web services that meet some of the core requirements
of most systems: storage, computing, messaging and data sets
Amazon provides standard interfaces based on SOAP and REST to interact with each service Libraries developed from Amazon or from one of the permissible languages, such as Ruby, Python, Java, Erlangen and PHP, to make exchanges with these services Command-line tools can also perform computational resource management on EC2 The REST interface is relatively easy to use; You can use the client-side program written in any language as the HTTP protocol to allow making requests to Web services
Storage
Amazon Simple Storage Service (S3) provides Web service interfaces for data storage and recovery Data is given in any kind and can be stored and accessed from any location via the Internet You can store an unlimited number of objects in S3 with the size of each object ranging from 1 byte to 5 GB The data warehouse can be located in the United States or European Union countries You can choose the storage location for your objects when you create buckets (similar to the concept of a directory) Data is securely stored using the same Amazon storage infrastructure
According to the leading experts on cloud computing, the development of cloud
computing in the future will focus on three main issues including: Likability (Federated), automation (Automated) and Client aware (terminal aware) These are also new
approaches to IT automation that allow users to respond to user requests in a new, more efficient and cost-effective manner Linked clouds will allow faster arrangement of
resources, while endpoint-aware clouds will take advantage of the unique capabilities of
Trang 33each device in an optimal way Cloud computing will be the most used technology in the future
I.6.2 Windows Azure
Windows Azure enables the deployment of Windows applications and data storage on Microsoft infrastructure via the Internet environment
Windows Azure provides an application development environment using NET
Framework, Native Code Support for common languages like C #, Visual Basic, C ++,
or maybe by java Use Visual Studio or another development tool
Both Windows Azure applications and on-premise applications can access Windows Azure storage services through the REST API The storage service allows storing large data objects via blob, provides queues for communication between components in the application, and provides Table-based data with a simple query language (SQL) For applications that require traditional data storage, the Windows Azure Platform provides SQL Azure database services
Every application has a configuration file in Windows Azure By configuring this file, the application manager can configure the components that Windows Azure applications need to run
I.7 The situation of apply Cloud Computing in the world and Vietnam
Originating from grid computing from the 80s, on-demand computing (Utility
computing) and service software (SaaS), Oracle is a pioneer in deploying this technology
So far, cloud computing is being developed and provided by many vendors, including Amazon, Google, Data Synapse, Sales force as well as traditional vendors Microsoft,
Trang 34IBM, HP Has been a lot of people Personal use to big companies like L’Oreal,
General Electric, EBay, Coca-Cola accepted and used
Research firm Gartner assesses that the chief priority of CIOs will be virtualization and cloud computing enterprise applications to help their companies less worry about
managing their infrastructure believe in; focus more on steering the company's
development process According to the assessment, as of 2012, 80% of enterprises in the list of the top 1,000 companies (according to Fortune magazine - USA) will use at least some types of cloud services and about 20% the business will no longer own IT assets or infrastructure
Intending to solve data problems, calculations, services, for customers, cloud computing has been bringing great profits, providing a playground and a large market for suppliers service, so its rapid growth can be counted from day to day
According to VNUnet newspaper (April 28, 2009), 27.7% were concerned about data storage services in a survey of customers using cloud computing services; 17% on
financial related applications; 12.8% on e-mail
A recent report by the Gartner research team highlights the development of Cloud
Computing in the context of globalization According to the survey, about 50% of
businesses and organizations in the world are applying this modern technology with a growth rate of 17% per year In 2011, global cloud computing revenue reached US $ 2.4 billion Gartner predicts this number will nearly quadruple in 2012
Not out of the trend, Vietnam is gradually approaching cloud services through projects of some foreign enterprises such as Microsoft, Intel as well as from domestic developers and suppliers such as FPT, Biaki IBM is the pioneer enterprise in launching cloud computing center in Vietnam in September 2008 with the first customer is Vietnam Technology and Communication Joint Stock Company (VNTT) It can be said that
Vietnam is one of the first countries in ASEAN to use cloud computing
This technology is considered a solution to the problems that many companies are facing such as lack of IT capacity; limited infrastructure investment costs Currently many companies are wasting resources such as not fully exploiting the public server system
Trang 35capacity, investing too much in human terms Meanwhile, in theory, cloud computing will allow businesses to not focus too much on infrastructure or application upgrades, do not require large human resources and can easily change scale as needed
Although cloud computing is currently widely used in many developed countries around the world because of the significant benefits that it brings, but in Vietnam, businesses are not interested in this technology However, according to experts, this is the optimal
solution for businesses in our country to reduce costs as well as increase productivity at maximum level
Regarding the current situation of cloud computing application in Vietnamese
enterprises, the following conclusions can be drawn: Currently, there are a few large enterprises in Vietnam that put cloud computing into application and business
performance significantly improved However the number is quite small Most still just stop at interested and inquiring
So what are the main reasons for barriers to access and transition to the environment, such delayed cloud computing technology:
The first and most important reason is the perception of the business Each enterprise has
a different scale, organizational model, operating in different fields, but the guidelines and policies must be smoothed from the management team, executive board to the IT team and Relevant departments in the business Because, when selecting and switching to this technology platform, they face huge challenges in terms of capital, budget for initial investment costs (if building the Private Cloud or Hybrid Cloud model ), costs of
modernizing production and business processes, business management and operation processes, etc technical barriers, information security It can change all the traditional processes that businesses operate
The second reason is transmission bandwidth Cloud computing is highly dependent on the speed and bandwidth of the connection, directly affecting the performance and
response time of the application for the end user This is also one of the bottlenecks affecting the quality of service
Trang 36The third reason is the application, data and information security issues when everything
of an enterprise is put to the cloud This is also a matter of great interest and discussion in recent years: legal corridors, mechanisms, policies and methods to prevent and protect these important applications and data
However, it is also undeniable that the role and value of Cloud Computing is a major factor in promoting business transformation process in businesses and at the same time helping businesses operate more efficiently and intelligently More and more cost
savings
Currently, the situation is improving significantly According to a recent survey of
Symantec, a leading software company in the world, about 46% of Vietnamese
enterprises and organizations are deploying cloud computing and other virtualization projects
Security firm Symantec also said that Vietnamese businesses have a special interest in cloud computing applications and the opportunities that this new technology brings Results from the survey show that 39% of domestic enterprises are currently using
private virtual software (VPS) services, while 21% are virtualizing servers and databases The application of cloud computing technology in business as well as in life is an
inevitable development step with the trend of the times It is predicted that this is the 3rd wave of technology, which will affect the habits and thinking of applying technology today The difficult thing is how businesses as well as individuals accept the trend
For most Vietnamese businesses today (mostly small and medium-sized businesses), the language barrier is the biggest obstacle in the search for technologies to assist them Therefore, Vietnamese suppliers will be a bridge for domestic enterprises to follow the world technology trend It requires a certain level of technology, an understanding of the habits and culture of Vietnamese businesses, and above all the ability to train the market associated with technology Cloud Technology Provider in Vietnam does all three things well, so the Vietnamese market will not only be a potential market anymore
Cloud computing technology is a common trend of the era, the application; widespread development is indispensable, especially in the context of globalization But to keep up
Trang 37the trend, to market flourish, there are many urgent things Changing a working mindset,
an operating habit is something that suppliers have to do Vietnamese enterprises see and accept
Trang 38II Big Data - Overview And Its Impact On Health Care
II.1 The definition of Big Data and What is Big Data in Health Care System and
Why we need it?
II.1.1 The Definition Of Big Data
Big Data is the term has meaning a very large and complex data set that the traditional data tools cannot process However, Big Data contains a lot of valuable information that
if successfully extracted, will help a lot for business, scientific research, predicting
upcoming diseases, and even Real-time traffic conditions determination Therefore, these data must be collected, organized, stored, searched, and shared in a different way than normal In this article, invite you to learn about Big Data, the methods people use to exploit it, and how it helps our lives
According to the page SearchDataManagement "Big Data is a combination of structured, semi structured and unstructured data collected by organizations that can be mined for information and used in machine learning projects, predictive modeling, and other
advanced analytics application”
The tremendous progress of science and technology, to the advent of communication channels that require an extremely large global data system such as social networks and advanced technology devices, poses a challenge It is not small for different industries to find other ways to handle data
Big Data (big data) is a collection of data that has a capacity beyond the performance of traditional applications and tools Big Data size is getting bigger and more important in the modern world and especially the current industry 4.0, as of 2012 it can range from a few dozen terabytes to many petabytes (1 petabyte = 1024 terabyte)
II.1.2 What Is Big Data In Health Care System? And Why We Need It?
The application of big data in health care has many impacts It will use specific data of patients at the time of examination and it can help to prevent disease, cure diseases, and reduce costs, etc
Trang 39Doctors always want to know as much as possible about a patient, and any signs of a disease as soon as possible to treat any disease in the early stages it will always be
simpler and less expensive, as well as have a better cure for incurable diseases
With today's improved technology, collecting that huge data has become easier and it is also converting it into important, useful, relevant information, which can then be used to provide for better care
This is the purpose of health-care data analysis: using data-based findings to predict and solve problems before it's too late, but also to assess treatment methods and regimens quickly better, better daily tracking, involving patients more about their own health and empowering them to access tools to do so
II.2 The Characteristics Of Big Data
In 2001, analyst Doug Laney of META Group (now the research firm Gartner) said that the challenges and opportunities in data growth can be described in three
dimensions: increasing volume, increase in velocity (velocity) and increase in variety (variety) They along with many other companies and organizations continue to use this "3V" model By 2012, Gartner added that Big Data in addition to the above three features must "need new forms of processing to help make decisions, delve into
things, and optimize the working process" Here is Gartner's new concept of Big Data
2014 under the "5Vs" model - 5 important properties that say Big Data
o Volume Talk about the amount of data created and stored The size of the data will
be assessed as valuable and potential, and to consider whether it can be considered as big data
With a large amount of daily information constantly updated on the Internet, such as Facebook receives nearly 350 million images, more than 4.5 billion likes, and nearly
10 billion messages and comments every day For that reason, traditional types of data storage and analysis are in no way possible But with the technology we are talking
Trang 40about here, it can easily process and store all the information on separate small branch systems
o Variety (Variety of data types) This concept refers to the type of data (nature of data) This helps analysts effectively use the details of the results They are composed
of text (text), image (image), sound (sound), video; plus it completes the missing part through data aggregation algorithms
o Velocity (High-speed processing capability) In this day and age, data rates are created and processed to meet the needs and challenges that lie in the path of growth and development Big data is usually available in real-time [6]
o The speed of data created and then transferred from one place to another has now reached a staggering rate Like people can chat with each other on Facebook at a fast pace in today's network environment Big data allows us to analyze the generated data without saving them to the database
o Variability: The current data types are so diverse and inconsistent that it can
interfere with its processes and management
o Value of data The quality of the data collected can vary greatly, which will greatly affect the accurate analysis We can see this is nature as well as the concept that
businesses or researchers who want to use and exploit Big Data must hold and
understand it first
II.3 Stored in Big Data Analysis System
II.3.1 Hard disk storage technology
Disk storage often uses low-cost hard drives for long-term storage Disk storage can be done via distributed file systems or databases
Distributed file storage system
Distributed file systems support schema-less data storage, providing redundancy and high availability by copying data to different locations implemented with a distributed file