Linux
Linux User Commands Tutorial: Administration & Management
As Linux is a multi-user operating system, there is a high need of an administrator, who can...
Apache NiFi is an open source software for automating and managing the data flow between systems. It is a powerful and reliable system to process and distribute data. It provides web-based User Interface to create, monitor, and control data flows. It has a highly configurable and modifiable data flow process to modify data at runtime.
Apache NiFi is easily extensible through the development of custom components.
In this Apache NiFi tutorial, you will learn-
Here, are reasons for using Apache Nifi:
Apache NiFi has a well-thought-out architecture. Once data is fetched from external sources, it is represented as FlowFile inside Apache NiFi architecture.
Here are key components of NiFi architecture
| Nifi Component | Description |
|---|---|
| FlowFile | FlowFile is original data with meta-information attached to it. It allows you to process not only CSV or other record-based data, but also pictures, videos, audio, or any other binary data. |
| Flowfile processor | Performs the work which acts as a building block of data flow in NiFi. |
| Flow controller | Keeps a record of how processes are connected. It manages the threads and allocations thereof which all processes use. |
| Web Server | Web server hosts NiFi's HTTP-based commands and API. |
| Extension | There are many types of NiFi extensions which operate and execute within the JVM. |
| Connection | Acts as a linkage between processors that contain a queue and relationship(s) which affects where data is routed. |
| Back Pressure | Stop the system of becoming overrun by controlling the quantity or data size of flow files that can be stored in the queue. |
| Process Group | A process group is a set of processes and their connections, which receives and send data with the help of ports. |
| Flowfile Repository | In the FlowFile Repository, NiFi keeps track of the state of what details it has about a given FlowFile which is active in the flow. |
| Content Repository | The Content Repository is an area where the actual content bytes of a given FlowFile exist. |
| Provenance Repository | The Provenance Repository is an area where all provenance event data is gathered. |
Below is a step by step process for Apache NiFi installation
Step 1) Go to the link,
and click "Continue to Subscribe"
Step 2) On the next page,
Click "Accept Terms"
Step 3) You will see this page,
Thank you for subscribing to this product! We are processing your request.
Step 4) Refresh the page after 5 minutes.
Click on "Continue to Configuration"
Step 5) On the next page,
Keep settings default and click "Continue to Launch"
Step 6) On the next page,
Click on Launch. You may need to create a key
Step 7) You will see this success message.
Congratulations! An instance of this software is successfully deployed on EC2!
Step 8) Note,
The instance id and the public DNS of the EC2 instance
Step 9) In the security group,
Add all traffic rule to inbound and outbound
Step 10) To access Nifi,
simply use the URL
http://publicdns:808/nifi
In our case it becomes
http://ec2-100-26-156-57.compute-1.amazonaws.com:8080/nifi/
User: admin
Password: Instance Id
Step 11) You will see,
NiFi home screen
Below is a list of Apache NiFi use case:
| Industry | Usage |
|---|---|
| Insurance |
|
| HealthCare |
|
| Telecommunication |
|
| Manufacturing |
|
| Oil & Gas- Industry |
|
| Financial Services |
|
As Linux is a multi-user operating system, there is a high need of an administrator, who can...
Music visualizers are software that can generate animated imagery that follows loudness, frequency spectrum,...
{loadposition top-ads-automation-testing-tools} There are a lot of open source tools and testing...
In data analysis you can sort your data according to a certain variable in the dataset. In R, we...
What is Variable? A variable is a concept or theoretical idea which can be described in measurable terms....
What is crontab? Cron is named after Greek word "Chronos" that is used for time. It is a system...