Techno Blender
Digitally Yours.
Browsing Tag

Edwin

Netflix First Trailer for DC Comics Show

Earlier in the year, it was reported that Warner Bros. Discovery had successfully sold off its upcoming comic book show Dead Boy Detectives over to Netflix. During this weekend’s Geeked Week event, the streamer showed off the first trailer for the show, and the young sleuths duo looks they’ll be getting up to some pretty fun adventures. V. E. Schwab on Her New Fantasy Series, Anime, and Book Recommendations | io9 Interview (Part 2)Based on characters created by Matt Wagner and Neil Gaiman, and now technically a spinoff of…

How big is the universe? Henrietta Leavitt led Edwin Hubble to a better, bigger answer.

In the early 1900s, the universe seemed to be a much, much smaller place. Back then, astronomers believed the Milky Way galaxy was all there was. They didn’t know there were billions of other galaxies; they didn’t know how small we really are. They didn’t know this because they couldn’t measure distances to far-flung stars. Why? There was a pretty simple problem in astronomy: A bright, faraway star looks almost the same as a dim star that’s close by. It’s the same here on Earth. Imagine you’re on the beach at night…

Productionize Machine Learning Models with Serverless Container Services | by Edwin Tan | Jan, 2023

How to create serverless containerized inference endpoint for your machine learning models with Azure Container AppPhoto by Jan Canty on UnsplashServerless container architecture is an approach to building and running containerized applications and services without having to manage the underlying infrastructure. In this architecture, containers are used to package and deploy applications, and these containers are run in a fully managed environment provided by a cloud provider.The cloud provider is responsible for the…

How to Test PySpark ETL Data Pipeline | by Edwin Tan | Dec, 2022

Validate big data pipeline with Great ExpectationsPhoto by Erlend Ekseth on UnsplashGarbage in garbage out is a common expression used to emphasize the importance of data quality for tasks such as machine learning, data analytics and business intelligence. With increasing amount of data being created and stored, building high quality data pipelines have never been more challenging.PySpark is a commonly used tool to build ETL pipelines for large datasets. A common question that arises while building data pipeline is “How…

Get your Melina on with this Elden Ring denim cloak from Edwin – Destructoid

You’ll never be Maidenless The prestige Japanese clothing company Edwin revealed a new line of apparel based upon Bandai Namco’s smash-hit RPG Elden Ring — As spotted by Siliconera, the Elden Ring x Edwin line will be made available to pre-order from the denim enthusiast’s official website this month, and is perfect for battling those winter winds (and giant hands). First and foremost is this unique hooded cloak, stylized after that worn by Elden Ring’s top Kindling Maiden, Melina. The design features a wide hood and an…

How to Prepare Scikit-Learn Models for Production | by Edwin Tan | Sep, 2022

Serve scikit-learn models with FastAPI and DockerImage from PixabayData Scientists spend large amounts of effort gathering business requirements, performing exploratory data analysis, data pre-processing, feature engineering, hyperparameter tuning and model evaluation only to have their models stuck in local notebook environments. In order to unlock the full value of the trained models, the models have to be made available to downstream applications. In this article, we walk through the steps to serve scikit-learn machine…

10 VSCode Productivity Hacks for Data Scientists | by Edwin Tan | Sep, 2022

10x your productivity with these VSCode extensionsPhoto by Andreas Klassen on UnsplashVisual Studio Code (VSCode) is one of the most popular Integrated Development Environment amongst data science professionals. It supports a large number of programming languages, comes with terminal integration, built-in debugger, intuitive keyboard shortcuts, Jupyter Notebook support and customizable layout. If you can’t find built-in functionalities that meet your needs, head over to VSCode extension marketplace. It has an extensive…

How to Generate Docstrings for Data Science Projects | by Edwin Tan | Aug, 2022

Generate clear and well formatted python docstrings in secondsPhoto by Kati Hoehl on Unsplash“I wrote this function 6 months ago and now I can’t remember what it does!” Does this sound familiar? In the midst of rushing for deadlines, we often overlook the importance of good documentation (aka docstrings) for the class, methods and functions that we created.So what are docstrings?Docstrings also known as documentation strings are string literals which describes a python class, function or method. Here’s an example of a…

Midimax Compression for Large Time-Series Data | by Edwin Sutrisno

Lightweight and fast compression algorithm in Python for time-series plotsPhoto by Nicholas Cappello on UnsplashMotivationVisualization is a powerful and critical step for reasoning with our data. Plotting large time-series data however generates heavy file sizes which slow down user interaction and strain computing resources such as RAM, disk, network and more. In my line of work, time-series data from machinery monitoring sensors may be recorded at rates from 1 Hz (1 point per second) to thousands of Hz, easily…

How to Deploy Scikit-Learn Models to Azure Container Instances | by Edwin Tan | Jun, 2022

Productionize your Scikit-Learn models with Azure Container InstancesNow you have trained your scikit-learn models, what’s next? How can it be made available to downstream applications as an API? In this article we will examine how to train and deploy scikit-learn models as an API using MLFlow, Azure Machine Learning and Azure Container Instances. Here are brief description of the services that we will be using.Photo by SpaceX on UnsplashWhat is MLFlow?MLFlow is an open source platform to manage the ML lifecycle,…