W H I T E P A P E R
www.persistent.com
© 2017 Persistent Systems Ltd. All rights reserved.
10.3.2 Azure Components in detail
10.3.2.1 Data Factory
The azure data factory service allows to create the data pipelines that moves and transform the data, and then
run the pipelines on a specified schedule (hourly, daily, weekly, etc.). Basically, its purpose is to ingest the data
from various on-premises and cloud data sources to Azure.
Features
• Easily move data movement from different azure cloud based storage and various Databases input
sources.
• We can also move data from different file system’s (e.g. HDFS, Amazon S3 and FTP).
• Ability to transform data using activities (e.g. Hive, pig and map-reduce etc.)
• Visualize, manage and monitor entire network pipeline to identify issue and tracks.
10.3.2.2 Log Analytics
Log analytics helps to collect and analyses the log data generated by various cloud resources or on-premises
resources (i.e. this service collects log data). All collected logs will get stored in Operation Management Suite
(OMS) repository.
Features
• Ability to gather the different types of log information such as text file log on windows, Windows Event
logs Windows Performance, Linux Performance counters, IIS logs, Syslog, Azure Storage etc.
• Advanced searching on gathered log information with the help of supported keywords (e.g. error, computer
name and timeout etc.) and search query language(e.g. system error | sort ManagementGroupName).
• Ability to schedule the alerts based on search criteria.
• The OMS UI allows to create dashboards/insights of log data for better visualization.
• Can export log data to POWER BI tool.
10.3.2.3 Document DB (NoSQL database)
Document DB is a fully managed NoSQL database as a service (DBaas).The Document DB is NoSQL document
based database which provides fast and predictable performance, high availability, elastic scaling, and global
distribution.
Features
• Support of SQL syntax to make query over the multiple documents.
• Document DB creates the index automatically on all the documents.
• This service has support of JavaScript language, which allows user to write transactional logic, triggers,
user-defined functions and stored procedure etc.
• Can easily integrate with HDInsight service (Hadoop service).
• It provides data access control over the multiple databases as well as resources with the help of master
key, read-only key and resource key etc.
• Automatically replicate all your data across region world-wide.
38