- Tips & Tricks of Deploying Deep Learning Webapp on Heroku Cloud - Dec 24, 2021.
Learn model deployment issues and solutions on deploying a TensorFlow-based image classifier Streamlit app on a Heroku server.
Applications, Docker, DVC, GitHub, Heroku, Streamlit, TensorFlow
- 25 Github Repositories Every Python Developer Should Know - Nov 12, 2021.
Check out these repositories to help you improve your data science skills.
GitHub, Programming, Python
- 11 Most Practical Data Science Skills for 2022 - Oct 19, 2021.
While the field of data science continues to evolve with exciting new progress in analytical approaches and machine learning, there remain a core set of skills that are foundational for all general practitioners and specialists, especially those who want to be employable with full-stack capabilities.
Career Advice, Data Science Skills, Explainable AI, Feature Engineering, GitHub, NLP, Regression, SQL
- How to Ace Data Science Interview by Working on Portfolio Projects - Oct 13, 2021.
Recruiters of Data Science professionals around the world focus on portfolio projects rather than resumes and LinkedIn profiles. So, learning early how to contribute and share your work on GitHub, Deepnote, and Kaggle can help you perform your best during data science interviews.
Data Science, GitHub, Interview Questions, Kaggle, Portfolio, Project
- 8 Must-Have Git Commands for Data Scientists - Oct 8, 2021.
Git is a must-have skill for data scientists. Maintaining your development work within a version control system is absolutely necessary to have a collaborative and productive working environment with your colleagues. This guide will quickly start you off in the right direction for contributing to an existing project at your organization.
Advice, Data Scientist, GitHub
- GitHub Desktop for Data Scientists - Sep 29, 2021.
Less scary than version control in the command line.
Data Science, Data Scientist, GitHub, Version Control
- GitHub Copilot and the Rise of AI Language Models in Programming Automation - Sep 22, 2021.
Read on to learn more about what makes Copilot different from previous autocomplete tools (including TabNine), and why this particular tool has been generating so much controversy.
AI, Automation, GitHub, NLP, Programming
- Adventures in MLOps with Github Actions, Iterative.ai, Label Studio and NBDEV - Sep 16, 2021.
This article documents the authors' experience building their custom MLOps approach.
GitHub, Machine Learning, MLOps, Pipeline, Python, Workflow
- The Machine & Deep Learning Compendium Open Book - Sep 16, 2021.
After years in the making, this extensive and comprehensive ebook resource is now available and open for data scientists and ML engineers. Learn from and contribute to this tome of valuable information to support all your work in data science from engineering to strategy to management.
Deep Learning, ebook, GitHub, Machine Learning, Open Source
- 3 Data Acquisition, Annotation, and Augmentation Tools - Aug 27, 2021.
Check out these 3 projects found around GitHub that can help with your data acquisition, annotation, and augmentation tasks.
Computer Vision, Data Annotation, Data Labeling, Datasets, GitHub, NLP, Synthetic Data
- GitHub Copilot: Your AI pair programmer – what is all the fuss about? - Jul 5, 2021.
GitHub just released Copilot, a code completion tool on steroids dubbed your "AI pair programmer." Read more about it, and see what all the fuss is about.
AI, Generative Models, GitHub, NLP, Programming
- 5 Data Science Open-source Projects You Should Consider Contributing to - Jun 7, 2021.
As you prepare to interview for a position in data science or are looking to jump to the next level, now is the time to enhance your skills and your resume with by working on rea, open-source projects. Here, we suggest a great selection of projects you can contribute to and help build something awesome, so, all you need to do choose one and tackle it head on.
Caffe, Data Science, Data Science Skills, GitHub, Google, Machine Learning, Open Source
- How to organize your data science project in 2021 - Apr 19, 2021.
Maintaining proper organization of all your data science projects will increase your productivity, minimize errors, and increase your development efficiency. This tutorial will guide you through a framework on how to keep everything in order on your local machine and in the cloud.
Advice, Data Science, GitHub, Project
- Going Beyond the Repo: GitHub for Career Growth in AI & Machine Learning - Jan 21, 2021.
Many online tools and platforms exist to help you establish a clear and persuasive online profile for potential employers to review. Have you considered how your go-to online code repository could also help you land your next job?
AI, Career Advice, GitHub, Machine Learning
- Build a Data Science Portfolio that Stands Out Using These Platforms - Jan 19, 2021.
Making your big break into the data science profession means standing out to potential employers in a crowd of tough competition. An important way to showcase your skills and experience is through the presentation of a portfolio. Following these recommendations for developing your portfolio will help you network effectively and stay on top of an ever-changing field.
Career Advice, Data Science, GitHub, Kaggle, LinkedIn, Portfolio
- 5 Most Useful Machine Learning Tools every lazy full-stack data scientist should use - Nov 18, 2020.
If you consider yourself a Data Scientist who can take any project from data curation to solution deployment, then you know there are many tools available today to help you get the job done. The trouble is that there are too many choices. Here is a review of five sets of tools that should turn you into the most efficient full-stack data scientist possible.
Data Science Tools, Data Scientist, GitHub, Heroku, Machine Learning, Postgres, PyCharm, PyTorch, scikit-learn, Streamlit
- Learn to build an end to end data science project - Nov 11, 2020.
Appreciating the process you must work through for any Data Science project is valuable before you land your first job in this field. With a well-honed strategy, such as the one outlined in this example project, you will remain productive and consistently deliver valuable machine learning models.
Data Preparation, Data Science, GitHub, Portfolio, Python, Regression, Salary
- 6 Lessons Learned in 6 Months as a Data Scientist - Oct 8, 2020.
When transitioning into a Data Science career, a new mindset toward collaboration, data, and reporting is required. Learn from these recommendations on approaches you should consider to successfully develop into your dream job.
arXiv, Business, Career Advice, Data Scientist, Deployment, GitHub, Podcast
- 4 Tools to Speed Up Your Data Science Writing - Sep 9, 2020.
This article covers how you can achieve your writing goals with these 4 tools.
Advice, Data Science, GitHub, Jupyter
- Modern Data Science Skills: 8 Categories, Core Skills, and Hot Skills - Sep 8, 2020.
We analyze the results of the Data Science Skills poll, including 8 categories of skills, 13 core skills that over 50% of respondents have, the emerging/hot skills that data scientists want to learn, and what is the top skill that Data Scientists want to learn.
Communication, Data Preparation, Data Science Skills, Data Visualization, Excel, GitHub, Mathematics, Poll, Python, Reinforcement Learning, scikit-learn, SQL, Statistics
- GitHub is the Best AutoML You Will Ever Need - Aug 12, 2020.
This article uses PyCaret 2.0, an open source, low-code machine learning library in Python to develop a simple AutoML solution and deploy it as a Docker container using GitHub actions.
Automated Machine Learning, AutoML, GitHub, PyCaret, Python
- A Complete guide to Google Colab for Deep Learning - Jun 16, 2020.
Google Colab is a widely popular cloud service for machine learning that features free access to GPU and TPU computing. Follow this detailed guide to help you get up and running fast to develop your next deep learning algorithms with Colab.
Deep Learning, GitHub, Google Colab, GPU, Jupyter
- Interactive Machine Learning Experiments - May 26, 2020.
Dive into experimenting with machine learning techniques using this open-source collection of interactive demos built on multilayer perceptrons, convolutional neural networks, and recurrent neural networks. Each package consists of ready-to-try web browser interfaces and fully-developed notebooks for you to fine tune the training for better performance.
Convolutional Neural Networks, GitHub, Image Recognition, Jupyter, Machine Learning, Recurrent Neural Networks, Tutorials
- Made With ML: Discover, build, and showcase machine learning projects - Mar 23, 2020.
This is a short introduction to Made With ML, a useful resource for machine learning engineers looking to get ideas for projects to build, and for those looking to share innovative portfolio projects once built.
GitHub, Kaggle, Machine Learning, Research
- The Most Useful Machine Learning Tools of 2020 - Mar 13, 2020.
This articles outlines 5 sets of tools every lazy full-stack data scientist should use.
Applications, GitHub, Machine Learning, Postgres, PyCharm, Tools
- Top 5 must-have Data Science skills for 2020 - Jan 8, 2020.
The standard job description for a Data Scientist has long highlighted skills in R, Python, SQL, and Machine Learning. With the field evolving, these core competencies are no longer enough to stay competitive in the job market.
2020 Predictions, Agile, Cloud Computing, Data Science Skills, Deep Learning, Deployment, GitHub, NLP
- GitHub Repo Raider and the Automation of Machine Learning - Nov 18, 2019.
Since X never, ever marks the spot, this article raids the GitHub repos in search of quality automated machine learning resources. Read on for projects and papers to help understand and implement AutoML.
Automated Machine Learning, GitHub, Machine Learning, Movies, Python
- Automatic Version Control for Data Scientists - Sep 24, 2019.
How can you keep your machine learning models and data organized so you can collaborate effectively? Discover this new tool set available for better version control designed for the data scientist workflow.
Data Scientist, GitHub, Jupyter, Version Control
- Top 10 Statistics Mistakes Made by Data Scientists - Jun 7, 2019.
The following are some of the most common statistics mistakes made by data scientists. Check this list often to make sure you are not making any of these while applying statistics to data science.
Data Science, Data Scientist, GitHub, Mistakes, Statistics
- PyViz: Simplifying the Data Visualisation Process in Python - Jun 6, 2019.
There are python libraries suitable for basic data visualizations but not for complicated ones, and there are libraries suitable only for complex visualizations. Is there a single library that handles both these tasks efficiently? The answer is yes. It's PyViz
Data Visualization, GitHub, Matplotlib, Python
- How to Automate Tasks on GitHub With Machine Learning for Fun and Profit - May 3, 2019.
Check this tutorial on how to build a GitHub App that predicts and applies issue labels using Tensorflow and public datasets.
Datasets, GitHub, Python, TensorFlow
- Trending Deep Learning Github Repositories - Feb 1, 2019.
Check these pair of resources for trending and top GitHub deep learning repositories for some new ideas on what to be looking out for.
Deep Learning, GitHub, Trends
- Papers with Code: A Fantastic GitHub Resource for Machine Learning - Dec 31, 2018.
Looking for papers with code? If so, this GitHub repository, a clearinghouse for research papers and their corresponding implementation code, is definitely worth checking out.
GitHub, Machine Learning, Research
- Top 13 Python Deep Learning Libraries - Nov 2, 2018.
Part 2 of a new series investigating the top Python Libraries across Machine Learning, AI, Deep Learning and Data Science.
Caffe, Deep Learning, GitHub, MXNet, Python, PyTorch, TensorFlow, Theano
- Top 8 Python Machine Learning Libraries - Oct 9, 2018.
Part 1 of a new series investigating the top Python Libraries across Machine Learning, AI, Deep Learning and Data Science.
GitHub, Keras, Machine Learning, Python
- Visualising Geospatial data with Python using Folium - Sep 27, 2018.
Folium is a powerful data visualization library in Python that was built primarily to help people visualize geospatial data. With Folium, one can create a map of any location in the world if its latitude and longitude values are known. This guide will help you get started.
Data Visualization, Geospatial, GitHub, Python
- Journey to Machine Learning – 100 Days of ML Code - Sep 7, 2018.
A personal account from Machine Learning enthusiast Avik Jain on his experiences of #100DaysOfMLCode, a challenge that encourages beginners to code and study machine learning for at least an hour, every day for 100 days.
GitHub, K-nearest neighbors, Machine Learning, Python, SVM
- From Data to Viz: how to select the the right chart for your data - Aug 1, 2018.
We offer an interactive, decision tree-style tool, which examines the data you have and proposes a set of potentially appropriate visualizations to represent your dataset.
Data, Data Visualization, ggplot2, GitHub, R, Tidyverse
- How To Create Natural Language Semantic Search For Arbitrary Objects With Deep Learning - Jun 13, 2018.
An end-to-end example of how to build a system that can search objects semantically.
Pages: 1 2
Deep Learning, GitHub, Neural Networks, NLP, Semantic Analysis
- GANs in TensorFlow from the Command Line: Creating Your First GitHub Project - May 16, 2018.
In this article I will present the steps to create your first GitHub Project. I will use as an example Generative Adversarial Networks.
GANs, Generative Adversarial Network, GitHub, Neural Networks, Python, Rubens Zimbres, TensorFlow
- Jupyter Notebook for Beginners: A Tutorial - May 1, 2018.
The Jupyter Notebook is an incredibly powerful tool for interactively developing and presenting data science projects. Although it is possible to use many different programming languages within Jupyter Notebooks, this article will focus on Python as it is the most common use case.
Pages: 1 2
Data Analysis, GitHub, Jupyter, Matplotlib, Python
- Top 16 Open Source Deep Learning Libraries and Platforms - Apr 24, 2018.
We bring to you the top 16 open source deep learning libraries and platforms. TensorFlow is out in front as the undisputed number one, with Keras and Caffe completing the top three.
Caffe, GitHub, Keras, Machine Learning, Open Source, TensorFlow
- How Do I Get My First Data Science Job? - Apr 2, 2018.
Here are the steps you need to obtain your first job in data science, including details on how to create a good portfolio, key networking tips, getting the right education and managing expectations.
Advice, Career, Data Science Education, Data Scientist, GitHub, Jobs, Kaggle
- Ranking Popular Distributed Computing Packages for Data Science - Mar 20, 2018.
We examined 140 frameworks and distributed programing packages and came up with a list of top 20 distributed computing packages useful for Data Science, based on a combination of Github, Stack Overflow, and Google results.
Apache Spark, Data Science, Distributed Systems, GitHub, Hadoop
- Top 20 Python AI and Machine Learning Open Source Projects - Feb 20, 2018.
We update the top AI and Machine Learning projects in Python. Tensorflow has moved to the first place with triple-digit growth in contributors. Scikit-learn dropped to 2nd place, but still has a very large base of contributors.
GitHub, Machine Learning, Open Source, Python, scikit-learn, TensorFlow
- Natural Language Processing Library for Apache Spark – free to use - Nov 28, 2017.
Introducing the Natural Language Processing Library for Apache Spark - and yes, you can actually use it for free! This post will give you a great overview of John Snow Labs NLP Library for Apache Spark.
Apache Spark, API, GitHub, John Snow Labs, Machine Learning, NLP
- Search Millions of Documents for Thousands of Keywords in a Flash - Sep 1, 2017.
We present a python library called FlashText that can search or replace keywords / synonyms in documents in O(n) – linear time.
Algorithms, Data Science, GitHub, NLP, Python, Search, Search Engine, Text Mining
- Deep Learning Zero to One: 5 Awe-Inspiring Demos with Code for Beginners, part 2 - Jul 1, 2017.
Here are deep learning examples and demos you can just download and run, including Spotify Artist Search using Speech APIs, Symbolic AI Speech Recognition, and Algorithmia API Photo Colorizer.
AI, Algorithmia, Beginners, Clarifai, Deep Learning, GitHub, iOS, Speech Recognition, Spotify
- K-means Clustering with Tableau – Call Detail Records Example - Jun 16, 2017.
We show how to use Tableau 10 clustering feature to create statistically-based segments that provide insights about similarities in different groups and performance of the groups when compared to each other.
Pages: 1 2
Clustering, Data Analysis, GitHub, K-means, Tableau, Telecom
- How A Data Scientist Can Improve Productivity - May 25, 2017.
Data Science projects involve iterative processes and may need changes in data at every iteration. But Data versioning, data pipelines and data workflows make Data Scientist’s life easy, let’s see how.
CRISP-DM, Data Scientist, Data Workflow, DVC, GitHub, Version Control
- DataScience.com Releases Python Package for Interpreting the Decision-Making Processes of Predictive Models - May 24, 2017.
DataScience.com new Python library, Skater, uses a combination of model interpretation algorithms to identify how models leverage data to make predictions.
Datascience.com, GitHub, Interpretability, Python
- Data Version Control: iterative machine learning - May 11, 2017.
ML modeling is an iterative process and it is extremely important to keep track of all the steps and dependencies between code and data. New open-source tool helps you do that.
CRISP-DM, DVC, GitHub, Machine Learning, Open Source, Reproducibility, Version Control
- Machine Learning-driven Firewall - Feb 23, 2017.
Cyber Security is always a hot topic in IT industry and machine learning is making security systems more stronger. Here, a particular use case of machine learning in cyber security is explained in detail.
Firewall, Fsecurify, GitHub, Machine Learning, Security
- Top 20 Python Machine Learning Open Source Projects, updated - Nov 21, 2016.
Open Source is the heart of innovation and rapid evolution of technologies, these days. This article presents you Top 20 Python Machine Learning Open Source Projects of 2016 along with very interesting insights and trends found during the analysis.
GitHub, Machine Learning, Open Source, Python, scikit-learn
- Top 10 Open Dataset Resources on Github - May 31, 2016.
The top open dataset repositories on Github include a variety of data, freely available for use by researchers, practitioners, and students alike.
Datasets, GitHub, Machine Learning, Open Data
- A Data Science Approach to Writing a Good GitHub README - May 4, 2016.
Readme is the first file every user will look for, whenever they are checking out the code repository. Learn, what you should write inside your readme files and analyze your existing files effectiveness.
Algorithmia, GitHub, Text Mining
- Top 10 IPython Notebook Tutorials for Data Science and Machine Learning - Apr 22, 2016.
A list of 10 useful Github repositories made up of IPython (Jupyter) notebooks, focused on teaching data science and machine learning. Python is the clear target here, but general principles are transferable.
Data Science, Deep Learning, GitHub, IPython, Machine Learning, Python, Sebastian Raschka, TensorFlow
- Top 10 Data Science Resources on Github - Mar 24, 2016.
The top 10 data science projects on Github are chiefly composed of a number of tutorials and educational resources for learning and doing data science. Have a look at the resources others are using and learning from.
Coursera, GitHub, IPython, Johns Hopkins, Open Source, Top 10
- Top 10 Data Visualization Projects on Github - Feb 22, 2016.
Github provides a number of open source data visualization options for data scientists and application developers integrating quality visuals. This is a list and description of the top project offerings available, based on the number of stars.
D3.js, Data Visualization, GitHub, Matthew Mayo, Open Source, Top 10
- Top 10 Deep Learning Projects on Github - Jan 13, 2016.
The top 10 deep learning projects on Github include a number of libraries, frameworks, and education resources. Have a look at the tools others are using, and the resources they are learning from.
Caffe, Deep Learning, GitHub, Open Source, Top 10, Tutorials
- Top 10 Machine Learning Projects on Github - Dec 14, 2015.
The top 10 machine learning projects on Github include a number of libraries, frameworks, and education resources. Have a look at the tools others are using, and the resources they are learning from.
Pages: 1 2
GitHub, Machine Learning, Matthew Mayo, Open Source, scikit-learn, Top 10
- Top 20 Python Machine Learning Open Source Projects - Jun 1, 2015.
We examine top Python Machine learning open source projects on Github, both in terms of contributors and commits, and identify most popular and most active ones.
GitHub, Machine Learning, Open Source, Python, scikit-learn
- Awesome Public Datasets on GitHub - Apr 6, 2015.
A long, categorized list of large datasets (available for public use) to try your analytics skills on. Which one would you pick?
Pages: 1 2
Datasets, Finance, GitHub, Government, Machine Learning, NLP, Open Data, Time series data
- Employee Churn 201: Calculating Employee Value - Apr 4, 2014.
Much has been written about customer churn. This post examines employee churn - an equally important problem and its unique dynamics.
Employee Churn, Employee Value, GitHub, Pasha Roberts, R, Talent Analytics