WORKING FILES
〰️
WORKING FILES 〰️
Govt Stuff
〰️
Govt Stuff 〰️
Youtube - Lawrence Wong on AI NDP Rally 2025
Straits Times - Jobs become National Security Issue
Straits Times - Millions Lose Jobs by 2027
Straits Times - Top 15 Jobs Singaporeans are Looking For
Straits Times - PM Lee outlines 3 geopolitical storms Singapore faces
Lee Hsien Loong meets Tim Cook
Lee Hsien Loong meets Mark Zuckerberg
Sunrise vs Sunset Industries Singapore 2024
TI Stuff
〰️
TI Stuff 〰️
DS AI Stuff
〰️
DS AI Stuff 〰️
SSG Data Scientist Skills Map
SG AI Verify
Data Stuff
〰️
Data Stuff 〰️
Cloud Stuff
〰️
Cloud Stuff 〰️
JOBS Stuff
〰️
JOBS Stuff 〰️
Singapore Stuffs
〰️
Singapore Stuffs 〰️
Alvin's Stuffs
〰️
Alvin's Stuffs 〰️
Scammers
〰️
Scammers 〰️
Straits Times - Sentosa Money Laundering Fujian Gang
Straits Times - Chia Teck Leng Cheats $117 million
CNA - Malone Lam Crypto USD 230 million
CNA - Casino Excel MBS Baccarat
Mothership - 3 Arrows Zhu Su Crypto Jail
CNA - Vizzio CEO Jon Lee Fake Phd
Straits Times - Forex Trader Scammer
Straits Times - AI Investment Scammer
Apache Spark Training
〰️
Apache Spark Training 〰️
Step 1: Local Laptop Cluster Setup
Teach them Local laptops Cluster Setup… then ask them to do it on another laptop and try
Anaconda3-2022.10-Windows-x86_64
https://www.apache.org/dyn/closer.lua/spark/spark-3.3.1/spark-3.3.1-bin-hadoop2.tgz
https://github.com/steveloughran/winutils/archive/refs/heads/master.zip
Environment Variables (windows)
HADOOP_HOME = C:\SPARK\hadoopJAVA_HOME = C:\Program Files\Java\jdk1.8.0_202SCALA_HOME = C:\SPARK\scalaSPARK_HOME = C:\SPARK\sparkPYSPARK_PYTHON = C:\Users\user\anaconda3\python.exePATH Variables:%SPARK_HOME%\bin%HADOOP_HOME%\bin%SCALA_HOME%\bin%JAVA_HOME%\bin
Jupyter Environment Variables
PYSPARK_DRIVER_PYTHON = C:\Users\User\anaconda3\Scripts\jupyter.exePYSPARK_DRIVER_PYTHON_OPTS = notebookSTep 2: Local Spark Shell
Step 3: Spark cluster in Google Cloud
Step 4: Spark cluster in AWs Databricks
Step 5: Spark on colab
How To Start A Spark Session & Read in CSV frrom Website.ipynb
Datacamp Pyspark Cheatsheets
Step 6: Pyspark DataFrames
Dataframes with PySpark Example 1 by Dr Alvin.ipynb
Dataframes with PySpark Example 2 by Dr Alvin.ipynb
purchases.csv (this dataset has 4 millions rows.. u can’t open in Excel!)
DataFrames with PySpark Example 2 (EXERCISE) by Dr Alvin Ang.ipynb
Step 7: Showcase VAEX
VAEX for Crunching Big Data by Dr Alvin Ang.ipynb
Automated_Traffic_Volume_Counts.csv (this dataset is 3GB!!!)
Others
hierarchical-clustering-with-python-and-scikit-learn-shopping-data.csv
Structured_Streaming_with_PySpark.ipynb
KMEANS_PySpark_by_Dr_Alvin.ipynb
Hierarchical_Clustering_In_Spark_With_Bisecting_K_Means.ipynb
Normalizer,_Scaler,_Bucketizer_and_Binarizer_with_PySpark_by_Dr_Alvin_Ang.ipynb
https://www.linkedin.com/in/nandeshreddy/
Soya Kim - Big Data Healthcare
Spark Commands by Raj Bharat.txt
https://www.youtube.com/watch?v=lisIQ9ohU8g
https://www.datanami.com/2024/03/05/duckdb-walks-to-the-beat-of-its-own-analytics-drum/
Tensorflow Training
〰️
Tensorflow Training 〰️
https://starttechacademy.com/cnn-for-computer-vision-in-python/
Keras Datasets: https://keras.io/api/datasets/
CHEQUE - Digital Recognition by Deep Learning Techniques.pdf
halal food prediction paper using ANN.pdf
https://thehighestofthemountains.com/brainmaps.html
deep learning interview questions.pdf
https://stanford.edu/~shervine/teaching/cs-229/cheatsheet-deep-learning
https://raw.githubusercontent.com/tertiarycourses/datasets/master/video_game_sales_training.csv
https://generated.photos/faces/asian-race
https://quickdraw.withgoogle.com/
https://neuralnetworksanddeeplearning.com/index.html
ANN
Dr. Alvin’s IBF Day 3 - ANN Regression.ipynb
Dr. Alvin’s IBF Day 3 - ANN Classification.ipynb
CNN
Understanding CNN with Dr. Alvin.ipynb
Dr. Alvin's IBF Day 4 CNN on MNIST Digits Dataset.ipynb
(Classifying Handwritten Digits with CNN)
Trained Classification Model for Fashion MNIST Dataset.h5
Predicting Images of Clothes using CNN by Dr Alvin Ang.ipynb
Predicting Images of Common Objects using CNN by Dr Alvin Ang.ipynb
https://setosa.io/ev/image-kernels/
RNN
Understanding LSTM Input and Output (RNN) by Dr. Alvin Ang.ipynb
IBF Day 4: Predicting DBS Stock Price using RNN by Dr Alvin Ang.ipynb
Datacamp DL Cheatsheets
Python Training
〰️
Python Training 〰️
Beginner
Data Types: List / Tuples / Dictionary / SetsLogical Indexing and OperatorsControl Structures: If - Else / While / ForFunctions: Def / Lambda
Intermediate
[Comprehension.i.for.i.in.range] + [Mounting.Google.Drive.File.IO] + [Object.Oriented.Programming.Class.Inheritance .Parents] + [SQLite.Databasing] + [Error.Handling.IndexError.IOError.TypeError.ValueError]Mounting and Creating Files in Google Drive with Colab.ipynb
Updating HR Records using Object Oriented Programming by Dr. Alvin Ang
https://pandas.pydata.org/docs/user_guide/visualization.html#visualization-barplot
https://python-tricks.com/matplotlib-introduction/
https://python-tricks.com/plotting-in-pandas/
https://www.tutorialspoint.com/how-to-change-the-text-color-of-font-in-the-legend-using-matplotlib
https://tonysyu.github.io/raw_content/matplotlib-style-gallery/gallery.html
https://www.digitalocean.com/community/tutorials?q=python&hits_per_page=12
Python PDFS
30 Python Libraries to Boost Your Data Science Productivity.pdf
Python Data Science Tips Full Archive by avi chawla. subs tack. com
Alvin's Answer for Plano's Assessment.pdf
How to learn Python like a Pro.pdf
https://www.hospitalmanagementasia.com/tech-innovation/harnessing-the-power-of-data-in-healthcare/ (Sean Singhealth)
https://developers.googleblog.com/en/data-science-agent-in-colab-with-gemini/
Python Cheatsheets
Datacamp - Importing_Data_Cheat_Sheet_ Python.pdf
Datacamp - Working_With_Text_Data_in_Python.pdf
Datacamp - Working_with_Dates_and_Times_Cheat_Sheet_ Python.pdf
Datacamp - Seaborn_Cheat_Sheet.pdf
Datacamp - Reshaping_data_with_Python.pdf
Datacamp - Python_Cheat_Sheet.pdf
Datacamp - Python_Basics_Cheat_Sheet.pdf
Datacamp - Pandas_Cheat_Sheet.pdf
Datacamp - Numpy_Cheat_Sheet.pdf
Datacamp - Matplotlib_Cheat_Sheet.pdf
Datacamp - Data Wrangling Cheat Sheet PYthon.pdf
Datacamp - Regular_Expressions_Cheat_Sheet.pdf
why python index start at 0.pdf
Data_Science_With_Python_Workflow by Business Science.io.pdf
Plotting
PANDAS
[Import.Export.CSV.Dataframe.Slicing.Filtering.DropNA] + [Join.Append.Merge.Pivot.Groupby] + [Box.Scatter.Pie.Area.Histogram.Plots] + [Correlation.DateTime.Time.Series.Plots] + [Pipe.Apply]
How to Join Append Concat Two Tables with Python by Dr Alvin Ang.ipynb
Restructuring CSV for Data Science.pdf
Statistics
[Descriptive.Stats] + [Seaborn.Visualization] + [Hypothesis.Testing.ANOVA] + [LR.MR.R2]Day 4 with Dr Alvin.ipynb (Statistics)
Python Visualization
Data Visualization with Python by Dr Alvin Ang.ipynb
Top 7 Python Libraries for Data Visualization.pdf
Practical Guide to Matplotlib.pdf
Python Specials
DataPrep + MissingNo by Dr Alvin Ang.ipynb
Ways to Display Json Formats Neatly in Python by Dr Alvin Ang.ipynb
How to Create Random Data with Python by Dr Alvin Ang.ipynb
https://github.com/xiaohk/stickyland
https://www.kdnuggets.com/2022/12/top-5-nlp-cheat-sheets-beginners-professional.html
https://docs.python.org/3/library/functions.html
https://docs.python.org/3/library/stdtypes.html
https://docs.python.org/3/index.html
https://docs.python.org/3/library/index.html
https://fortune.com/education/articles/using-python-for-data-science/
Data Cleansing and Wrangling
〰️
Data Cleansing and Wrangling 〰️
Steps in Data Wrangling and Cleansing
Converting JSON to CSV from Scikit Learn Datasets by Dr. Alvin Ang.ipynb
How to Do Train Test Splits with Python by Dr Alvin Ang.ipynb
Data Wrangling a Population of Countries’ Dataset
Population of Countries in 2000.csv
Data Wrangling a Population of Countries Dataset by Dr Alvin Ang.ipynb
Data Wrangling & Visualizing Healthcare Datasets
Hospital Admissions
hospital-admissions-by-sector-annual.csv
Data Cleansing a Hospital Admissions Dataset by Dr Alvin Ang.ipynb
Health Expenditure
government-health-expenditure.csv
Data Cleansing a Government Health Expenditure Dataset.ipynb
Long Term Care Facilities
number-of-residential-long-term-care-facilities-sector-breakdown.csv
Data Wrangling a Long Term Care Facilities Dataset by Dr Alvin Ang.ipynb
Data Cleansing a Rock Song Dataset
Data Cleansing a Rock Song Dataset with Python by Dr. Alvin Ang.ipynb
Data Wrangling Air Quality Datasets
Data Wrangling Air Quality Datasets with Python by Dr Alvin Ang.ipynb
Searching and Slicing a Video Games Dataset
Searching and Slicing a Video Games Dataset with Python by Dr Alvin Ang.ipynb
Wrangling Automobile Datasets
Slicing & Dicing a Motorcars Dataset (European + Japanese Cars)
Slicing & Dicing a Motorcars Dataset with Python by Dr Alvin Ang.ipynb
Factors Affecting Price of European and Japanese Cars
Feature Selection on Automobile Dataset with Python by Dr. Alvin Ang.ipynb
Dealing with Missing Data in European Cars Dataset
Data Cleansing a European Automobile Dataset with Python by Dr Alvin Ang.ipynb
Cross Validating a European and Japanese Cars Dataset
Cross Validating a European and Japanese Car Dataset by Dr Alvin Ang.ipynb
Python for Finance
LendinClubLoan(22k rows - very dirty).csv
Lending Club Loan Data Dictionary.xls
https://www.kaggle.com/datasets/wordsforthewise/lending-club
https://drive.google.com/file/d/1VCaoIFxzpYgzCaIerj24EWx-wW87C440/view?usp=drive_link (alvin’s google drive FULL LendingClubLoan Dataset)
Feature Selection on Lending Club Loan Dataset by Dr Alvin Ang.ipynb
Data Cleansing the Lending Club Loan Dataset by Dr Alvin Ang.ipynb
Train Test Splitting the Lending Club Loan Dataset by Dr Alvin Ang.ipynb
Imploring YFinance by Dr Alvin Ang.ipynb
https://www.quantifisolutions.com/
Hypothesis Testing and ANOVA with Python
Hypothesis Testing and ANOVA with Python by Dr Alvin Ang.ipynb
Machine Learning Training
〰️
Machine Learning Training 〰️
Confusion Matrix
Linear and Logistic Regression
https://mlu-explain.github.io/
ML guide with Code by Shivam Modi.pdf
ML Life Cycle by Shivam Modi.pdf
Quick Machine Learning in Python.pdf
ML DL AI Cheat sheet by NIKHIL YADAV.pdf
ML Cheatsheet.pdf
Machine Learning Infographics Cheatsheet.pdf
ML Cheat sheet by Business Science.io.pdf
the little book of deep learning
AI for Everyone notes by Andrew Ng
How to Load the Iris Dataset into Python by Dr. Alvin Ang.ipynb
Various Places to Get Datasets for Machine Learning by Dr Alvin Ang.ipynb
Various Ways of Train Test Splits with Python by Dr Alvin Ang.ipynb
https://machinelearningprojects.net/
https://thecleverprogrammer.com/2020/11/15/machine-learning-projects/
30 Python Libraries to Boost Your Data Science Productivity.pdf
https://terencelucasyap.com/predicting-singapore-pools-4d-lottery-winning-numbers-machine-learning/
Scalable Efficient Big Data Pipeline Architecture – Machine Learning for Developers
MLOps
MLOps for Dummies Databricks.pdf
https://www.dailydoseofds.com/mlops-crash-course-part-1/
Unsupervised Learning
Clustering
Overview of Clustering Methods.pdf
KMeans_using_Python_by_Dr_Alvin.ipynb
Hierarchical_Clustering_using_Python.ipynb
Clustering Cheatsheet by Business Science.io.pdf
PCA
Train Test Split
Scaling
Supervised Learning
Linear / Multiple / Polynomial Regression
Simple Linear Regression with Statsmodel by Dr Alvin Ang.ipynb
Simple Linear Regression using SKLearn by Dr Alvin Ang.ipynb
Multiple Regression using Scikit Learn with Python by Dr Alvin Ang.ipynb (Advertising.csv)
Multiple Regression using Scikit Learn with Python (Part II) by Dr Alvin Ang.ipynb (AutomobileEDA.csv)
Polynomial Regression with Python by Dr Alvin Ang.ipynb
Support Vector Machine (SVM)
Understanding SVM using Python by Dr Alvin Ang.ipynb
Simple SVM Applied to Iris Dataset with Python by Dr Alvin Ang.ipynb
Grid, Random and Bayes Search - Hyperparameter Tuning on SVM with Python by Dr Alvin Ang.ipynb
Decision Tree / Random Forest
Decision Tree (Classification) on the Iris Flower Dataset using Python by Dr Alvin Ang.ipynb
Random Forest (Classification) on the Iris Flower Dataset using Python by Dr Alvin Ang.ipynb
Metrics, Normalization and Regularizations
Classification Metrics for ML Models by Dr Alvin Ang.ipynb
Bias / Variance
Understanding Bias vs Variance in Python by Dr. Alvin Ang.ipynb
L1 and L2 Regularization
L1 Lasso and L2 Ridge and Elastic Net Regression using Python by Dr Alvin Ang.ipynb
MinMax and Standard Scaler
Decision Tree (Classification) on the Iris Flower Dataset using Python by Dr Alvin Ang.ipynb
Datacamp ML Cheatsheets
ML for Trading
〰️
ML for Trading 〰️
Steps to Teach ML for Trading.txt
Steps for ML Trading by Dr. Alvin Ang
Facilitator Guide for Machine Learning 101 for Financial Trading.ipynb
Learning TA-Lib in Python by Dr. Alvin Ang.ipynb
How to Plot Candlestick Chart using Plotly by Dr. Alvin Ang.ipynb
pandas_ta full list of technical indicators as of 2024
MQL5 Programming for Traders.pdf
CFTE
https://algoventure.first-4.com/ai-algo-3267-2099-9273
https://www.ntuclearninghub.com/en-gb/-/course/algorithmic-trading-essentials
https://eoddata.com/stocklist/SGX.htm (ticker symbol)
https://algorithmictrading.substack.com/
https://www.priceactionlab.com/Blog/price-action-lab-software/
https://wire.insiderfinance.io/
https://www.gurufocus.com/guru/warren%2Bbuffett/summary
https://www.benzinga.com/apis/
https://www.youtube.com/@Algovibes
https://greyhoundanalytics.com/
https://singaporeanstocksinvestor.blogspot.com/ (AK)
https://www.dymonasia.com/career/
https://www.tower-research.com/
https://eodhd.medium.com/trading-predictions-using-ai-and-python-cdaad4de3447
https://github.com/suparjotamin/stockie
https://medium.com/trading-data-analysis/metatrader5-python-trading-bot-230bd19285e9
R Training
〰️
R Training 〰️
Basic R Course
Topic 1 - Introduction to R Data Types.R
Topic 2 - R Datasets and Data IO by Dr Alvin.R
Topic 3 - R Data Visualization by Dr Alvin.R
Topic 4 - R Programming by Dr. Alvin.R
Topic 5 - R Statistics by Dr. Alvin.R
Tidyverse Package Course
Topic 1 - Tidyverse Data Cleansing.R
Topic 2 - Tidyverse Data Summary.R
Topic 3 - Tidyverse Statistics.R
Topic 4 - Qualitative Data Analysis (Text Mining).R
Topic 5 - GGPlot Data Visualization.R
Files
https://r4ds.had.co.nz/index.html
https://www.tidytextmining.com/
Datacamp R Cheatsheets
Datacamp - Working_With_Text_Data_in_R.pdf
Datacamp - Working_with_Dates_and_Time_in_R.pdf
Datacamp - Reshaping_data_with_tidyR_in_R.pdf
Datacamp - ggplot2_cheat_sheet.pdf
Datacamp - data table cheat sheet_R.pdf
Datacamp - Manipulating_Data_in_dplyr_Cheat_Sheet.pdf
Text Mining with R
Text Mining with R by Dr. Alvin Ang.R
Data Wrangling with R
Data Wrangling with Tidyverse by Dr Alvin Ang.R
Data Wrangling with Core R by Dr Alvin Ang.R
Data Visualization with R
Data Visualisation with BASIC R by Dr. Alvin Ang.R
Data Visualisation with GGPLOT R by Dr. Alvin Ang.R
Regression with R
Simple Linear Regression using R by Dr. Alvin Ang.R
Multiple Regression using R by Dr Alvin Ang.R
Statistics with R
Statistics with Tidyverse by Dr Alvin Ang.R
R Sites
https://togaware.com/projects/rattle/index.html
https://universeofdatascience.com/
https://www.r-bloggers.com/2022/06/the-most-overlooked-r-package-that-can-get-you-through-a-data-science-job-interview/
https://online.stat.psu.edu/statprogram/tutorials/statistical-software/r
https://biostat.app.vumc.org/wiki/Main/RS
https://tuos-bio-data-skills.github.io/intro-stats-book/
https://cran.r-project.org/web/packages/available_packages_by_name.html
https://posit.co/resources/cheatsheets/
https://yihui.shinyapps.io/formatR/
https://www.kaggle.com/code/rtatman/data-cleaning-challenge-json-txt-and-xls
https://education.rstudio.com/learn/
https://www.business-science.io/finance/2020/02/26/r-for-excel-users.html
https://www.rdocumentation.org/
ML with R
https://lgatto.github.io/IntroMachineLearningWithR/index.html
https://matthewrenze.com/workshops/practical-machine-learning-with-r/
Tableau Training
〰️
Tableau Training 〰️
1st Project
Extras
2nd Project
Tableau Desktop Specialist
Profile Links
https://public.tableau.com/app/profile/dr.alvin.ang
https://public.tableau.com/app/profile/hisan.shafaque/viz/1996-97ChicagoBullsBuckets/BullsBuckets
https://public.tableau.com/app/profile/pawan.sachdeva/viz/HowHappyAreWe_15891520173890/HowHappyAreWe
https://public.tableau.com/app/profile/janapati.balaji
Storytelling
Change Over Time:
https://public.tableau.com/app/profile/ben.jones/viz/WorldPopulationDay/1_ChangeOverTime
https://public.tableau.com/app/profile/andy.kriebel/viz/EPLInjuries/InjuryCrisis
Drill Down:
https://public.tableau.com/views/EarthquakesOnTheRise-Full/Earthquakestory
https://public.tableau.com/app/profile/mac.bryla/viz/TellmeaboutWill/TellmeaboutWill
Zoom Out:
https://public.tableau.com/app/profile/halftimeheroes/viz/OlympicGamesStories-ZoomOut/ZoomOut
Contrast:
https://public.tableau.com/app/profile/robertrouse/viz/Pyramids_1/EgyptianPyramids
Outliers
Tableau Data Analyst
Maps
https://public.tableau.com/s/sites/default/files/media/co2_emissions_by_london_borough.zip
https://community.tableau.com/s/question/0D54T00000C6TDLSA3/singapore-map-not-displayed
https://data.gov.sg/search?res_format=SHP
https://www.tableau.com/solutions/customer/jal-expanding-the-use-of-tableau-throughout-the-company
https://www.tableau.com/visualization/what-is-geospatial-visualization
Others
Tableau Introduction by Princeton University.pdf
https://tableaupublic.princeton.edu/#/signin
https://www.thetableaustudentguide.com/
Tableau Official Help
https://public.tableau.com/app/profile/tableau.docs.team/vizzes
https://www.tableau.com/about/blog/LOD-expressions
https://public.tableau.com/en-us/s/resources
https://tableaupracticetest.com/#get-our-most-popular-downloads
Tableau whitepaper - visual analysis guidebook.pdf
Which Chart is Right for You by Tableau.pdf
3 blind men and an elephant by Tableau.pdf
3 Steps to make your Data Clearer by Tableau.pdf
4 Traits of Data Driven Financial Services Organization by Tableau.pdf
5 Best Practices for Creating Effective Dashboards by Tableau.pdf
5 Steps to Scalable Self-Service Analytics by Tableau.pdf
5 Things Your Spreadsheet Can't Do by Tableau.pdf
5 ways to maximise your salesforce data by Tableau.pdf
7 Best Practices for Mobile Business Intelligence A Whitepaper by Tableau.pdf
7 Tips for Success with Big Data in 2013 by Tableau.pdf
8 ways Universities are Making Impact with Data by Tableau.pdf
10 Essential Dashboards Every Retailer Should Use.pdf
Advanced Analytics with Tableau.pdf
Best Practices for Tableau Online.pdf
Big Data Trends Insights and Strategies from Tableau.pdf
Building Effective Dashboards by Tableau.pdf
Define Analytics by Tableau.pdf
Developing a Governed Self Serive BI Strategy by Tableau.pdf
Embedded Analytics by Tableau.pdf
Evaluations Guide How to Choose the Right MOdern BI platform by Tableau.pdf
Good Enough to Great by Tableau.pdf
how Wayfair and Pepsi use Visual Analysis to Dirve Business Results by Tableau.pdf
Making Flow Happen by Tableau.pdf
Modern Manufacturing 4 ways data is transforming by Tableau.pdf
Must Dos of Marketing Dashboards by Tableau.pdf
Redefining the Role of IT in a Modern BI World.pdf
Sales Force Attitudes towards Forecasting by Tableau.pdf
Sales Performance Report by Tableau.pdf
Solving the Internet of Things Last Mile Problem by Tableau.pdf
The Evolution of Data Storytelling by Tableau.pdf
The Marketing Analytics Evolution by Tableau.pdf
Top 5 Retail Trends for 2018 by Tableau.pdf
Top 7 Business Intelligence Trends for Government in 2017.pdf
Understanding LOD by Tableau.pdf
Visual Analysis for Everyone by Tableau.pdf
Which Chart or Graph is right for you by Tableau.pdf
SQL Training
〰️
SQL Training 〰️
https://www.dbta.com/Columns/SQL-Server-Drill-Down/
https://blog.devops.dev/sql-analysis-of-netflix-dataset-808e870e5bd6
https://github.com/pawelsalawa/sqlitestudio/releases
For those with problems installing….
https://alpha.sqliteviewer.app/
https://www.draxlr.com/tools/sql-formatter/
Oracle SQL
https://www.oracle.com/database/technologies/xe-downloads.html
https://www.oracle.com/database/sqldeveloper/
How to Speed Up SQL Queries.pdf
How to use SQL to Track User Retention.pdf
〰️
Power BI Training
〰️ Power BI Training
https://www.bernama.com/tv/news.php?id=2408564 (Donald Trump Tariffs)
https://workout-wednesday.com/power-bi-challenges/
https://training.foresightbi.com.ng/courses/power-bi-developer-internship
The 10 Commandments of Good Graphics by Steve Figard.pdf
Gartner MAGIC QUADRANT 2022 FOR BI TOOLS
Datacamp Power BI Cheatsheets
Datacamp - Formulas_in_DAX_Cheat_Sheet.pdf
Datacamp - Power+BI_Cheat+Sheet.pdf
how to tell data storytelling.pdf
how to build CEO dashboard.pdf
Resource Management Optimisation Training
〰️
Resource Management Optimisation Training 〰️
Statistics Training
〰️
Statistics Training 〰️
Degree of Freedom
Statistics Cheatsheets
15 Data Fallacies
Excel Training
〰️
Excel Training 〰️
Power Query / Power Pivot
〰️
Power Query / Power Pivot 〰️
Data Analytics with Excel Course Activities
Data Analytics with Excel Course
Excel Dashboard Template
Others
Design of Experiments (DOE)
〰️
Design of Experiments (DOE) 〰️
Flexsim
〰️
Flexsim 〰️
WEKA
〰️
WEKA 〰️
Google Workspace
〰️
Google Workspace 〰️
Looker Studio
〰️
Looker Studio 〰️
Tech with Tim's Facial Recognition Project (on local laptop)
〰️
Tech with Tim's Facial Recognition Project (on local laptop) 〰️
Data Quality Training
〰️
Data Quality Training 〰️
Bodies
https://www.dama.org/cpages/home
https://booksite.elsevier.com/9780123743695/10steps_DataCategories.pdf
Open Refine
https://librarycarpentry.github.io/lc-open-refine/aio.html
https://datacarpentry.github.io/OpenRefine-ecology-lesson/aio.html
https://web.archive.org/web/20190105063215/http://enipedia.tudelft.nl/wiki/OpenRefine_Tutorial
Tools
https://bestofbi.com/products/sql-power-dqguru-data-quality/
https://www.dataqualitypro.com/
https://datacleaner.github.io/
https://datascienceatthecommandline.com/
https://www.datafix.com.au/cookbook/index.html
Datasets
Checklist for data quality.xls
Blogs
https://atlan.com/data-governance-framework/?ref=/open-source-data-governance-tools/
https://atlan.com/data-governance-framework/
https://www.edq.com/blog/data-quality-vs-data-governance/
https://atlan.com/data-mesh-principles/
https://atlan.com/master-data-management-vs-metadata-management/
https://atlan.com/improve-data-quality/
https://atlan.com/data-quality-metrics/
https://www.gartner.com/smarterwithgartner/how-to-improve-your-data-quality
https://tdan.com/category/data-topics/data-quality-articles-blogs-education
Singapore Public Service
Statistical Best Practices by Singstat (2020).pdf
Improve Data Quality by Singstat (2020).pdf
STB Data Governance Playbook.pdf
Slides and Whitepapers
EY - Becoming an Analytics Organization.pdf
Creating an Enterprise Data Strategy - Beye Network.pdf
A Definitive Guide to Data Governance - Trillium Software.pdf
TADA Data Quality Concepts.pdf
Introduction to Data Governance and Stewardship - Salesforce.pdf
5 Levels of Master Data Management Maturity - Baseline Consulting.pdf
Dataprep - Acclerate Data for AI.pdf
ISO Data Quality 8000-1 (partial).pdf
ISO-8000-61-2016 (partial).pdf
A Product Perspective on Total Data Quality Management - Richard Wang.pdf
Datacamp Data Quality Cheatsheets
Datacamp - Cheat Sheet DS For Business Leaders.pdf
Datacamp - Data Quality Dimensions.pdf
Splunk Tutorial
〰️
Splunk Tutorial 〰️
UCMHP
〰️
UCMHP 〰️
Sample OIP Projects
〰️
Sample OIP Projects 〰️
Intelligent Real-time Medicine Inventory Management Solution
How might we develop an centralized inventory solution that provides real-time intelligence to optimize the demand and supply for medicinal drugs and reduce the wastage and cost inefficiencies for dispensaries?
Award: SGD30,000
Understanding Customer Order Behaviour for More Efficient Supply Chain
How can we better understand our customers’ orders to fulfil their requests in a timely manner to
improve the customer experience, while ensuring efficiency in supply chain planning and inventory
management?
Award: SGD 25,000
Tracking Solution for Liquid-based Stock
How can we help our customers track and manage their liquid-based stock on a real-time basis?
Award: SGD 10,000
Dynamic Digital Floor Plans, Time Schedules and Route Optimization for Exhibition Spaces
How might we better plan and execute the movement of exhibits into and out of diverse and dynamic event sites with greater efficiency?
Award: SGD 35,000
Data Analytics To Identify Opportunities For Developing Better Financial And Non-Financial Offerings
How might we better understand our customer’s needs, so we develop financial and nonfinancial offerings (e.g. products, insights, education materials, etc.) that better meets their current and emerging needs to better engage the investors in the customer buying journey?
Award: SGD 50,000
Real-Time Data Visualization and Predictive Analytics
How might we create a platform, drawing on our database of real-time receipts data, to not only visualise shopper past behaviour but also to provide predictive shopper actionable insights that will enable brand owners to timely, effectively reach and engage targeted
shoppers?
Award: SGD 20,000
Elevated Customer Experience Through Integrated Insights From Digital Solutions
How can we elevate our customers’ digital experience in a seamless and personalised manner by providing valuable integrated insights across Carrier’s suite of digital solutions?
Award: SGD 25,000
Collaborative Platform For Mall Tenants To Provide Complementary Services That Delight Mall Goers
How might we create a digital collaboration tool, powered by collective data insights, for shopping mall tenants to collaborate among themselves, so as to provide complementary products and services for the mall-goers?
Award: SGD 30,000
Predictive Recommendation Engine To Match Business Event Attendees With The Right People And The Right Content
How might we create a predictive engine for more accurate match rate for professional event attendees to networks and content during and after the event?
Award: SGD 35,000
Visualization Tool that Provides Data Driven Insights For Better Event Evaluation
How might we create a visualisation tool to process visitors and other related-event data so as to empower event organisers and owners to better evaluate event success and ROIs?
Award: SGD 35,000
Smart Customer Relationship Management System
How might we automate the ordering process to be more efficient and effective, and achieve a better customer experience?
Award: SGD 35,000
Understanding Customer Order Behaviour For Better Customer Experience
How might we gain insights of the customer’s ordering behavior to facilitate an improved order management with better customer experience?
Award: SGD 25,000
A Data Analytic Collaborative Platform for Insurance Ecosystem Partners and Clients
How might we create a shared data platform with our ecosystem of partners and clients that can support innovative health and life insurance services?
Award: SGD 20,000
Understanding Users' Behaviour Through Data Analytics And Transaction Tracking Methods Within Mobile App
How might we provide an integrated mobile platform that can enable, track and manage communications and transactions between app users (home residents) and ResidentServices™ Providers (RSPs) and effectively connecting and engaging them, so that we can turn these data into valuable insights, predictions and decisions?
Award: SGD 20,000
Healthcare Oncology Patient Experience Portal
How might we create a digital platform to empower cancer patients and their families along their treatment and recovery journey with external parties and treatment information, so as to make better and sound decisions and stay connected with key communities?
Award: SGD 25,000
Analytics and Insights to Improve Efficiency of Internal Business & Operational Processes
How might we improve our internal processes to derive relevant insights, particularly through utilising our data, in order to operate more effectively and efficiently?
Award: SGD 20,000
Enhance Maintainability of Pre-Fabrication Facility
How might we intelligently manage and perform preventive maintenance of equipment and machinery in our construction and pre-fabrication facility?
Award: SGD 30,000
Automation Of Data Extraction And Verification For Contact List Building
How might we automate the collection and verification of potential customer contact lists, to increase accuracy and save time for higher value work?
Award: SGD 20,000
Asset Management & Investment Recommendation Solution for Portfolio Managers and Analysts
How might we create solution that provides robust investment models, that takes into account the huge amounts of data that are available for analysis, in order to come up with well-timed actionable investment recommendations that results in market outperformance, after all possible slippage costs are considered?
Award: SGD 20,000
Intelligent Dynamic Scheduling & Placement System to Facilitate Grave Exhumation & Reinterment
How might we create a dynamic yet able to meet time specific needs with automated exhumation scheduling and reinterment placement process of graves?
Award: SGD 35,000
Integrated Procurement Chain System for Optimisation of F&B Ecosystem
How might we provide an integrated procurement chain system that can connect F&B operators’ front-of-house to its suppliers’ warehousing and logistics needs seamlessly, to enable smarter purchasing decisions?
Award: SGD 25,000
Interesting Stuff
〰️
Interesting Stuff 〰️
Useful Sites
〰️
Useful Sites 〰️
Learning
Whether you're prepping for interviews or just want to level up, these are perfect for you.
1. 𝗔𝗣𝗜 𝗙𝘂𝗻𝗱𝗮𝗺𝗲𝗻𝘁𝗮𝗹𝘀: https://lnkd.in/e8eMet_k
2. 𝗔𝗣𝗜 𝗦𝗶𝗺𝗽𝗹𝗶𝗳𝗶𝗲𝗱: https://lnkd.in/er9JiGxw
3. 𝗔𝗣𝗜 𝗠𝗲𝘁𝗵𝗼𝗱𝘀: https://lnkd.in/ey9v7-hU
4. 𝗔𝗣𝗜 𝗧𝗲𝗿𝗺𝗶𝗻𝗼𝗹𝗼𝗴𝗶𝗲𝘀: https://lnkd.in/eRsPMzpd
5. 𝗔𝗣𝗜 𝗔𝘂𝘁𝗵𝗲𝗻𝘁𝗶𝗰𝗮𝘁𝗶𝗼𝗻: https://lnkd.in/eNPfpAdE
6. 𝗔𝗣𝗜 𝗦𝘁𝗮𝘁𝘂𝘀 𝗖𝗼𝗱𝗲𝘀: https://lnkd.in/egXizUrS
7. 𝗥𝗘𝗦𝗧 𝗔𝗣𝗜 𝘃𝘀 𝗚𝗿𝗮𝗽𝗵𝗤𝗟: https://lnkd.in/eZHREdgC
8. 𝗔𝗣𝗜 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻: https://lnkd.in/eDASPP5m
9. 𝗔𝗣𝗜 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻 𝗶𝗻 𝗗𝗲𝘁𝗮𝗶𝗹: https://lnkd.in/eZwFVrH7
10. 𝗔𝗣𝗜 𝗧𝗲𝘀𝘁𝗶𝗻𝗴: https://lnkd.in/emgmWJqH
11. 𝗔𝗣𝗜 𝘄𝗶𝘁𝗵 𝗣𝘆𝘁𝗵𝗼𝗻: https://lnkd.in/eM23ah2y
12. 𝗔𝗣𝗜 𝗦𝗰𝗮𝗹𝗶𝗻𝗴: https://lnkd.in/e3mZSvmn
13. 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗶𝗻𝗴 𝗥𝗼𝗯𝘂𝘀𝘁 𝗔𝗣𝗜𝘀: https://lnkd.in/eBXzbFyg
14. 𝗔𝗣𝗜𝘀 𝘄𝗶𝘁𝗵 𝗣𝗼𝘀𝘁𝗺𝗮𝗻: https://lnkd.in/ezue3d4B
15. 𝗧𝗲𝘀𝘁𝗶𝗻𝗴 𝗔𝗣𝗜𝘀 𝘄𝗶𝘁𝗵 𝗣𝗼𝘀𝘁𝗺𝗮𝗻: https://lnkd.in/eCPnGTGi
16. 𝗔𝗣𝗜 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆: https://lnkd.in/e79ZYfPa
17. 𝗔𝗣𝗜𝘀 𝗳𝗼𝗿 𝗘𝘃𝗲𝗿𝘆𝗼𝗻𝗲: https://lnkd.in/e4WGDffA
https://datacarpentry.org/lessons/
Real Life Analysts / Scientists
Others
https://thenewstack.io/why-are-so-many-developers-hating-on-object-oriented-programming/
https://medium.com/machine-words/the-rise-and-fall-of-object-oriented-programming-d67078f970e2
https://iianalytics.com/community/blog/the-age-of-agile-must-end
https://statguyuser.github.io/feature-engg-selection-for-explainable-models.github.io/
https://medium.com/mlearning-ai/this-study-shows-how-ai-can-change-your-job-f55d88256e1
https://waitbutwhy.com/2015/01/artificial-intelligence-revolution-1.html
https://unicsoft.com/blog/building-an-ai-proof-of-concept-benefits-and-steps/
https://oa.mg/journals/open-access-computer-science-journals
AI for MUSIC
Warehousing & Inventory Management
〰️
Warehousing & Inventory Management 〰️
“Crazy Alvin with so many Famous People!”
Aquaman
Arnold
Bil Gates
Dr. Alvina 1
donald trump
elon 2
harrison ford
iron man
jensen huang
Jesus 3
joe biden
larry ellison
mahathir
michael jackson
nadella
piyush gupta
sean connery
superman
tiger woods
warren buffet
xi jinping
Andy Lau
Batman
Brad PItt
Dr. Alvina 2
arnaldvin
george washington
hinton
jack ma
jensen 2
Jesus 4
john kennedy
larry page
mao ze dong
modi
najib
pope
steve jobs
sylvester stallone
tim cook
wonder woman
Andrew Ng
Aaron Kwok
Bruce WIllis
datafrens 1
datafrens 3
einstein
ghandi
hulk
jackie chan
Jesus 1
Jesus 5
keanu reeves
lawrence wong
mark zuckerberg
monalvin
obama
steven spielberg
the rock
tom cruise
yan lecunn
Anwar
Bill Clinton
captain america
datafrens 2
deng xiaoping
elon 1
godzilla
i show speed
jeff bezos
Jesus 2
Jesus
kim jung un
lee kuan yew
michael jordan
morgan freeman
nicole kidman
putin
sundah pichai
thor
ultraman
zelensky