Course Details
Topic 1: Overview of Text Mining
- What is Text Mining
- Upload Corpus
- Upload Document
Topic 2: Preprocessing Text
- Pre-process Text
- Stopwords
- Bag of Words (BoW)
- Similarity Hashing
Topic 3: Online API Search
- Wikipedia
Topic 4: Text Clustering
- Cosine Distance
- Hierarchical Clustering & Dendrogram
- Text Clustering Workflow
- Similarity Clustering with Hashing
Topic 5: Text Classification
- What is Classification
- Classification Algorithms
- Text Classification and Prediction
- K-Fold Cross Validation
- Similarity Hashing
Topic 6: Topic Modeling
- Latent Semantic Indexing (LSP)
- Latent Dirichlet Allocation (LDA)
Course Info
Prerequisite:
This course is for beginner. No programming or coding knowledge is required.
Software Requirement
Pls download and install the following software prior to the class
HRDF Funding
Please refer to this video https://youtu.be/Kzpd-V1F9Xs
1- HRD Corp Grant Helper
How to submit grant applications for HRD Corp Claimable Courses
2- Employers are required to apply for the grant at least one week before training commences.
Employers must submit their applications with supporting documents, including invoices/quotations, trainer profiles, training schedule and course content.
3- First, Login to Employer’s e-TRIS account -https://etris.hrdcorp.gov.my
Second, Click Application
4- Click Grant on the left side under Applications
5- Click Apply Grant on the left side under Applications
6- Click Apply
7- Choose a Scheme Code and select HRD Corp Claimable Courses: Skim Bantuan Latihan Khas. Then, click Apply
8- Scheme Code represents all types of training that suit the requirements provided by HRD Corp. Below are the list of schemes offered by HRD Corp:
9- Select your Immediate Officer and click Next
10- Select a Training Provider, then click Next
11- Please select a training programme from the list, then key in all the required details and click Next
Select your desired training programme.
Give an explanation on why the participant is required to attend the training. E.g., related to their tasks/ career development, etc.
Explain the background and objective of this training.
Select a relevant focus area. For Employer-Specific Courses, select ‘Not Applicable’.
12- If the training programme is a micro-credential programme, you are required to complete these 3 fields. Save and click Next
Insert MiCAS Application number
13- Based on the nine (9) pillars listed below, HRD Corp Focus Area Courses are closely tied to support government initiatives towards nation building. As such, courses offered through the HRD Corp Focus Areas are designed to provide the workforce with skills required for current and future demands. Details of the focus areas are as follows:
14- Please select a Course Title and Type of Training
15- Select the correct type of training according to the actual type of training, or as mentioned in the training brochure:
16- Please key in the Training Location and click Next
17- Please select the Level of Certification and click Next
18- Please follow the instructions and key in trainee details
19- Click Add Batch, then click Save
20- Click Add Trainee Details
21- Please key in all the required details, then click Add
22- Click Add if there are more participants. Once done, click Save
23- Click Next
24- Please key in the course fees and allowance details, then click Save
25- Estimated cost includes the course fees/external trainer fees, allowances, and consumable training materials. Please comply with the HRD Corp Allowable Cost Matrix.
26- Select Upfront Payment to Training Provider and key in the percentage from 0% to 30%. Then, click Save and Next
27- Complete the declaration form and select a desired officer
28- Add all the required documents, then click Add Attachment. Then, click Save and Submit Application
29- Once the New Grant Application is successfully submitted, the Grant Officer will evaluate the application accordingly. The application may be queried if additional information is required.
The application status will be updated via the employer’s dashboard, email, and the e-TRiS inbox.
Job Roles
- Data Scientist
- Natural Language Processing Engineer
- Text Mining Specialist
- Machine Learning Engineer (focused on text data)
- Content Analyst
- Big Data Analyst
- Sentiment Analysis Specialist
- Information Retrieval Engineer
- Computational Linguist
- Data Journalist (with coding skills)
- SEO Specialist (using text analytics for insights)
- Digital Marketing Analyst
- Knowledge Engineer
- Chatbot Developer
- Content Recommendation System Developer.
Trainers
Dr. Aanand: Dr. Aanand is a Full Stack Data Scientist who once had a torrid love affair with Physics. He has consulted and published in the area of Public Health, Electricity Markets, Telecom, BFSI, Advertising & Communication Strategies and Digital & Social Media Technologies. He has worked on assignments with international agencies such as International Monetary Fund, World Bank, Royal Netherland Embassy etc. besides MNCs like Tata Consultancy Services, Kie Square Consulting and several government organizations of national importance. He regularly conducts general training programs in Python (Pandas, NumPy, SciPy, Matplotlib, Bokeh), R (dplyr, rstanarm, knitR, ggplot2), Data Visualization (Tableau, D3.js) and Machine Learning (Reinforced Learning, Scikit Learn) and specialized training programs on Structural Equation Modeling and SAP Hana. He holds a doctorate in Operations Research from Indian Institute of Management Ahmedabad and a post-graduate in Physics from the University of Mumbai. He has advanced training in mathematical programming including optimization, advanced multivariate data analysis, and simulation techniques. When he is not teaching or consulting he can be found meditating or heading for an adventurous trek.