Introduction
Data mining is a computational process of extracting valuable data from large data sets through sorting to identify the correlations among them and visualizing data that communicates critical insights. There are various tools to carry out the process of data mining. Some of them are Orange, Knime, Kaggle, Rapid miner, etc.
In this article, we will extensively discuss the orange tool for data mining.
Orange for Data Mining
Orange is a powerful platform for data analysis and visualization, data flow, and productivity. It is an open-source data mining tool and includes the end of visual programs for fast quality data analysis for testing and interactive data display.
You need to download and install orange from this site to work with orange. Follow the following steps to set up a workflow.
Step 1: Open Orange software
When you open orange, the first thing you will see is a welcome screen. The welcome screen provides options to start a new data analysis, open a recent one or explore tutorials.
Step 2: Exit the welcome screen
Let's exit the welcome screen. Orange starts with a blank canvas, and the left side of the window contains various widgets. Widgets are computational units of orange; they read the data, process it, visualize it, do clustering, build predictive models and otherwise help you explore the data.
Step 3: Load file widget
The workflow of most of the data analysis begins with the file widget. The file widget helps to load our data. To load it on the canvas, click on the icon, and it will appear on the canvas.
Step 4: Load the file in the file widget
Double click the icon to open the file widget and browse documentation sets and load one of the existing data files that come with orange, or you can load your data.
Step 5: Load the data table widget
Now, the data table widget needs to be loaded on the canvas to see what data has been selected. To load this widget, click on the widget, and it will appear on the screen.
Step 6: Connect file widget to data table widget
Widgets can communicate with one another with the help of their input channel and output channel.
Drag the line from the file widget right output side to the data table widget left input side, as shown in the image. Click the data table widget, and you will see the data you have loaded in the file widget.
Step 7: Load scatter plot widget
You can also plot the data by simply using a scatter plot widget. It will display all your data scatter plots.
Congratulations, you have just set up your first workflow with all the above steps.