Techno Blender
Digitally Yours.
Browsing Tag

EMR

Building a Semantic Book Search: Scale an Embedding Pipeline with Apache Spark and AWS EMR…

Image from UnsplashBuilding a Semantic Book Search: Scale an Embedding Pipeline with Apache Spark and AWS EMR ServerlessUsing OpenAI’s Clip model to support natural language search on a collection of 70k book coversIn a previous post I did a little PoC to see if I could use OpenAI’s Clip model to build a semantic book search. It worked surprisingly well, in my opinion, but I couldn’t help wondering if it would be better with more data. The previous version used only about 3.5k books, but there are millions in the…

The Regulators of Facebook, Google and Amazon Also Invest in the Companies’ Stocks

The top watchdog of American business is also home to Washington’s most active Wall Street investors.The Federal Trade Commission in recent years has opened investigations into nearly every major industry. It has launched antitrust probes into technology companies, examined credit card firms and moved to restrict drug, energy and defense-company mergers. At the same time, senior officials at the FTC disclosed more trades of stocks, bonds and funds, on average, than officials at any other major agency in a…

Test Driving Delta Lake 2.0 on AWS EMR — 7 Key Learnings | by Irfan Elahi | Oct, 2022

What I learned after using Delta Lake 2.0 on AWS EMR along with installation steps and performance benchmarksPhoto by Luís Sousa on UnsplashIf you have read my previous article about getting started with Delta Lake in AWS, you would have got the fundamental context and rationale that why offerings like Delta Lake are gaining traction and what type of use-cases they address. The article presented simple and easy steps to get started with delta lake and even though you can use the approach there to address certain some…