Getting Started

A simple explanation and implementation of gradient descent

Image created by me

Let’s say we have a fictional dataset of pairs of variables, a mother and her daughter’s heights:


Reason behind least squares

Image by author

A little intro to linear regression first:

Linear regression is about finding a linear model that best fit a given dataset.

For example, in a simple linear regression with one input variable (i.e. one feature), the linear model is a line with formula y = mx + b , where m is the slope and b the y-intercept.

The linear model of best fit is one that minimizes the sum of squared errors.

As shown in the image below, error is the difference between observed and predicted value.


Machine Learning, Programming

A case for machine learning in predicting speed and responsiveness

Image by author

As an engineer on the front-end infrastructure team of a large tech company, I deal with web application performance on a daily basis.

Performance is vital, yet throughout my career, I see teams punt on it until they can no longer get away with it.

I know performance means many things. In this article, I’m using it to refer to responsiveness, or speed of an application when users interact with it.

I think performance gets punted on because of two main reasons. One, performance work is hard. Two, it has little effect on customer value until it becomes poor. …


Hands-on Tutorials, Getting Started

The What, Why, and How of Hyperparameter Tuning

Image source

Hyperparameter tuning is an important part of developing a machine learning model.

In this article, I illustrate the importance of hyperparameter tuning by comparing the predictive power of logistic regression models with various hyperparameter values.

First thing’s first.

What are hyperparameters? — The what

Parameter vs. hyperparameter

  • Parameters are estimated from the dataset. They are part of the model equation. The equation below is a logistic regression model. Theta is the vector containing the parameters of the model.


How machine learning answers the question

A walkthrough of Logistic Regression and Naive Bayes.

Image source

The year was 1912, and the mighty Titanic set sail on her maiden voyage. Jack, a “20 year old” “third class” “male” passenger, won a hand of poker and his ticket to the land of the free. In the last hour of April 14th, Titanic struck an iceberg, and its fate was sealed. Will Jack survive this wreckage?

(Yes, I know he died in the movie, but if he were a real person, would he have survived?)

This is a binary classification problem because we’re trying to predict one of two outcomes…


Getting Started

A beginner’s guide to multiclass classification

Image by author

In my previous article, I talked about binary classification with logistic regression.

We had a list of students’ exam scores and GPAs, along with whether they were admitted to their town’s magnet school.


A beginner’s guide to spam classification

Image by author

Below is an email caught by Gmail’s spam filter. How did the spam filter decide this was a spam?


GETTING STARTED

How to solve binary classification with gradient ascent

Binary classification (Image created by me)

Let’s say you have a dataset where each data point is comprised of a middle school GPA, an entrance exam score, and whether that student is admitted to her town’s magnet high school.

Given a new pair of (GPA, exam score) from Sarah, how can you predict whether Sarah will be admitted? Sarah’s GPA is 4.3 and her exam score is 79.


Image from here

Supposed your manager asks you to “improve the front-end performance of our React app.” Where do you start?

Although most of the items on this list now feel like second nature, the me from three or four years ago would have found them extremely helpful.

From a first principle perspective, you can think of performance as boiling down to two factors: memory and CPU. The lower the usage, the better.

Latency is also important to consider. For example, network requests are much higher in latency than storing and reading cached data or assets. Improving latency is a discussion worth of…


Image

As an engineer, I’m at least 2 steps removed from the end-users of my work. I rarely interact directly with customers or listen to their problems and what they want.

As an engineer, I also rarely stop to think about what makes a good design and why good design is important. As long as the app works and does its job, why should design matter?

After all, in our line of work, if a piece of code doesn’t work as expected, it’s the engineer’s problem to figure out why. …

Lily Chen

Machine learning enthusiast

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store