Abstract-Presently educational institutions compile and store huge volumes of data such as student enrolment and attendance records, as well as their examination results. Mining such data yields stimulating information that serves its handlers well. Rapid growth in educational data points to the fact that distilling massive amounts of data requires a more sophisticated set of algorithms. This issue led to the emergence of the field of Educational Data Mining (EDM). Traditional data mining algorithms cannot be directly applied to educational problems, as they may have a specific objective and function.
This implies that a preprocessing algorithm has to be enforced first and only then some specific data mining methods can be applied to the problems. One such preprocessing algorithm in EDM is Clustering. Many studies on EDM have focused on the application of various data mining algorithms to educational attributes. Therefore, our project provides over three decades long (1983-2016) systematic literature review on clustering algorithm and its applicability and usability in the context of EDM.
mining.Ourproject classifies the students record based on achievement. Their result is predicted through data mining with the help of the internal marks scored. Future insights are outlined based on the literature reviewed, and avenues for further research are identified.
Data mining is a process used by companies to turn raw data into useful information. By using software to look for patterns in large batches of data, businesses can learn more about their customers to develop more effective marketing strategies, increase sales and decrease costs.
The biggest challenge is to analyse the data to extract meaningful information that can be used to solve a problem or for the growth of the business. There are powerful tools and techniques available to mine data and find insights from it. The data mining process breaks down into five steps. First, organizations collect data and load it into their data warehouses. Next, they store and manage the data, either on in-house servers or the cloud. Business analysts, management teams and information technology professionals access the data and determine how they want to organize it. Then, application software sorts the data based on the user’s results, and finally, the end user presents the data in an easy-to-share format, such as a graph or table. Data mining serves the primary purpose of discovering patterns among large volumes of data and transforming data into more refined/ actionable information
II. BACKGROUNDIn, Pauziah Moud Arshad, Usamah Binmat, Norlida Buniyamin Educational Data Mining for prediction and classification of students achievement
This paper highlights the importance of using student data to drive improvement in education planning. It then presents techniques of how to obtain knowledge from databases such as large arrays of student data from academic Institution databases. Further, it describes the development of a tool that will enable faculty members to identify, predict and classify students based on academic performance measured using Cumulative Grade point average (CGPA) grades.
In , Maizatul Akmar Ismail, Tutut Herawan and Ashish Dutt A Systematic Review on Educational Data Mining
This paper has also outlined several future insights on educational data clustering based on the existing literatures reviewed, and further avenues for further research are identified. The key advantage of the application of clustering algorithm to data analysis is that it provides relatively an unambiguous schema of learning style of students given a number of variables like time spent on completing learning tasks, learning in groups, learner behavior in class, classroom decoration and student motivation towards learning. Clustering can provide pertinent insights to variables that are relevant in separating the clusters.
In, Vandhana Dahiyas work survey on educational data mining, the proposed solution explains the objective of educational data mining into 3 categories educational, administrative, commercial.
In, Christobal Romero and Sebastian Venturas data mining in education, the proposed system introduces and reviews key milestones and the current state of affairs in the field of EDM, together with specific applications, tools, and future insights.
In, Rajni Jindal and Malaya Dutta Borahs survey on educational data mining and research trends, this survey work focuses on components, research trends (1998-2012) of Educational Data Mining highlighting its related tools, techniques and outcomes.
In Existing system, the Clustering and Classification Algorithms in Education Data Mining Process are used. In Clustering process c-means , K-means and classification process SVM classification or naive bayes Algorithms are used for clustering and classification process. After the Clustering process is completed the clustered input is passed to the Classifier process. It is an unsupervised approach for analyzing data in statistics, machine learning, pattern recognition, DM, and bio informatics. It refers to collecting similar objects together to form a group or cluster. Each cluster contains objects that are similar to each other but dissimilar to the objects of other groups. This approach when applied to analyze the dataset derived from educational system is termed as Educational Data Clustering (EDC). An educational institution environment broadly involves three types of actors namely Teacher, student and the environment. Interaction between these three actors generates voluminous data that can systematically be clustered to mine invaluable information. Data clustering enables academicians to predict student performance, associate learning styles of different learner types and their behaviors and collectively improve upon Institutional performance. Researchers, in the past have conducted studies on educational datasets and have been able to cluster students based on academic performance in examion IV. PROPOSED SYSTEM
In proposed system, the Fuzzy Classification and Clustering Algorithm for Educational Data Mining Process is used. Fuzzy clustering (also referred to as soft clustering) is a form of clustering in which each data point can belong to more than one cluster.
In Fuzzy Classification process it based on Student all semester marks details, total counting and College overall result. It gives the Classification result. This Classification result based Fuzzy Graph generate process it based on Education Department Based
Educational datas can be easily accessed
This process fully concentrates on educational purpose
Input design is the process of connecting the user-originated inputs into a computer to used formats. The goal of the input design is to make data entry Logical and free from errors. Errors in the input database controlled by input design this application is being developed in a user-friendly manner. The forms are being designed in such a way that during the processing the cursor is placed in the position where the data must be entered. An option of selecting an appropriate input from the values of validation is made for each of the data entered. Concerning clients comfort the project is designed with perfect validation on each field and to display error messages with appropriate suggestions. Help managers are also provided whenever user entry to a new field he/she can understand what is to be entered. Whenever user enter a error data error manager displayed user can move to next field only after entering a correct data
Computer output is the most important and direct source of information to the user. Efficient intelligible output design should improve the system’s relationship with the user and admin in decision-making. Output design generally refers to the results generated by the system. For many end users on the basis of the output the evaluate the usefulness of the application. Efficient software must be able to produce and efficient effective reports.
6.1 MODULE DESCRIPTION
6.1.1 Admin Module
In this admin login page provide the Process security the admin is handle in overall process so we implement the admin login security it avoid the un authentication person.
If the correct user was the login it send the successful message else it show the alert message.
6.1.2 Load Data/ Preprocessing
In this module, the admin may select the any Education Dataset (2013,2014,2015) in this dataset many type of attributes are there for Ex(Student department, Semester result ,Register/Roll
Data pre-processing is an important step in the data mining process. The phrase “garbage in, garbage out” is particularly applicable to data mining and machine learning projects.
-298451385570Data pre-processing IData pre-processing II
00Data pre-processing IData pre-processing II
Data-gathering methods are often loosely controlled, resulting in out-of-range values (e.g., Income: ?100), impossible data combinations (e.g., Sex: Male, Pregnant: Yes), missing values, etc. Analyzing data that has not been carefully screened for such problems can produce misleading results. number etc..,)
Fuzzy clustering (also referred to as soft clustering) is a form of clustering in which each data point can belong to more than one cluster.Clustering or cluster analysis involves assigning data points to clusters such that items in the same cluster are as similar as possible, while items belonging to different clusters are as dissimilar as possible. Clusters are identified via similarity measures these similarity measures include distance, connectivity, and intensity Different similarity measures may be chosen based on the thatFuzzy classification is the process of grouping elements into a fuzzy set whose membership function is defined by the truth value of a fuzzy propositional function. After the Classification process is completed the data was stored in department, class ,total ,average wise.This datas are stored in databases we calculate the over all class count for each department and total year college student education level report.a or the application.
Finally the classification report is completed it generate the Graph for based on Student department, class, total average, semester result