Thursday, July 18, 2024

vector databases tutorial

 What is a Vector Database & How Does it Work? Use Cases + Examples | Pinecone

A vector database indexes and stores vector embeddings for fast retrieval and similarity search, with capabilities like CRUD operations, metadata filtering, horizontal scaling, and serverless

Wednesday, July 17, 2024

AI embeddings without OpenAI, $0; tokenization @ Hugging Face API & libs

Hugging Face, Inc. is a French-American company that develops computation tools for building applications using machine learning. It is most notable for its transformers library built for natural language processing applications and its platform that allows users to share machine learning models and datasets and showcase their work.

data format: OData Web Services

used with Dynamics 365; based on XML

OData Web Services - Business Central | Microsoft Learn

The Open Data Protocol (OData) is a web protocol that is designed for querying tabular data and provides you with an alternative to SOAP-based web services. OData builds on web technologies such as HTTP and JavaScript Object Notation (JSON) to provide access to information from different applications, services, and stores. OData uses URIs for resource identification and commits to an HTTP-based, uniform interface for interacting with resources. This commitment to core Web principles allows for OData to enable a new level of data integration and interoperability across a broad range of clients, servers, services, and tools.

Open Data Protocol (OData) is an open protocol that allows the creation and consumption of queryable and interoperable Web service APIs in a standard way. Microsoft initiated OData in 2007.

The protocol enables the creation and consumption of HTTP-based Web APIs, which allow Web clients to publish and edit resources, identified using URLs and defined in a data model, using simple HTTP messages. OData shares some similarities with JDBC and with ODBC; like ODBC, OData is not limited to relational databases.

AI TDD code generator: micro-agent

 BuilderIO/micro-agent: An AI agent that writes (actually useful) code for you @GitHub
TypeScript, MIT license

An AI agent that writes and fixes code for you

Just run micro-agent, give it a prompt, and it'll generate a test
and then iterate on code until all test cases pass.

Monday, July 15, 2024

Nomic Atlas: AI data clustering tool

nomic-ai/nomic: Interact, analyze and structure massive text, image, embedding, audio and video datasets @GitHub

Python bindings for working with Nomic Atlas, the world’s most powerful unstructured data interaction platform. Atlas supports datasets from hundreds to tens of millions of points, and supports data modalities ranging from text to image to audio to video.

With Nomic Atlas, you can:
  • Generate, store and retrieve embeddings for your unstructured data.
  • Find insights in your unstructured data and embeddings all from your web browser.
  • Share and present your datasets and data findings to anyone.

Nomic Atlas  //

Interact, discover insights and build
with unstructured text, image and audio data.

Course: The Complete OPENAI JS APIs Course - Build 15 Projects | Udemy

AI: llm.c vs GPT-2, $672, 24h, by Andrej Karpathy

 Let's reproduce GPT-2 (1.6B): one 8XH100 node, 24 hours, $672, in llm.c · karpathy/llm.c · Discussion #677

by karpathy (Andrej)

In this post we are reproducing GPT-2 in llm.c. This is "the GPT-2", the full, 1558M parameter version that was introduced in OpenAI's blog post Better Language Models and their Implications in February 14, 2019. llm.c does so directly in C/CUDA (total of ~5,000 lines of code), without the typical training stack that would involve the Python interpreter and a significantly more complex deep learning library like PyTorch/JAX, huggingface/transformers, or etc. In 2019, training GPT-2 was an involved project from an entire team and considered a big model run but, ~5 years later, due to improvements in compute (H100 GPUs), software (CUDA, cuBLAS, cuDNN, FlashAttention) and data (e.g. the FineWeb-Edu dataset), we can reproduce this model on a single 8XH100 node in 24 hours, and for $672, which is quite incredible

Feed | LinkedIn by Aleksa Gordić | LinkedIn

Andrej Karpathy - Wikipedia

Andrej Karpathy (born 23 October 1986[2]) is a Slovak-Canadian computer scientist who served as the director of artificial intelligence and Autopilot Vision at Tesla. He co-founded and formerly worked at OpenAI,[3][4][5] where he specialized in deep learning and computer vision

Saturday, July 13, 2024

AI Data Center: costs & revenue?

Most of recent stock market growth is based on investments in AI, 
and that is mostly in data-centers powered by NVIDIA systems. 

While that is good for stock market, to be sustainable would require some tangible revenue. 
Will this come on time?

Biden chaos, Soft landing secured? AI sentiment turns bearish, French elections - YouTube
All-in podcast

data: kaggle: amazon book reviews

Amazon Books Reviews

Goodreads-books reviews and descriptions of each book

Books_rating.csv(2.86 GB)

Book Title: 212404 unique values


used in 

Friday, July 12, 2024

AI tool: 360 walkaround for cars

Automotive 360 Walkaround For Dealerships | Impel

.NET API code tool: Craftsman & Wrapt

Wrapt - Scaffold an entire .NET 6 Web API with a simple yaml or json file.

Scaffold an entire .NET 6 Web API with a simple yaml or json file so you can focus on the high value features in your web app.

pdevito3/craftsman: A .NET scaffolding tool to help you stop worrying about boilerplate and focus on your business logic 🚀 @GitHub

Craftsman is the workhorse behind the Wrapt framework and provides a suite of CLI commands for quickly scaffolding out new files and projects for your .NET Web APIs with simple CLI commands and configuration files.

Thursday, July 11, 2024

xAI supercomputer w/o Oracle cloud

Elon Musk on X: "@xDaily xAI contracted for 24k H100s from Oracle and Grok 2 trained on those...

"xAI contracted for 24k H100s from Oracle and Grok 2 trained on those. Grok 2 is going through finetuning and bug fixes. Probably ready to release next month. 

xAI is building the 100k H100 system itself for fastest time to completion. Aiming to begin training later this month. It will be the most powerful training cluster in the world by a large margin. 

The reason we decided to do the 100k H100 and next major system internally was that our fundamental competitiveness depends on being faster than any other AI company. This is the only way to catch up. 

Oracle is a great company and there is another company that shows promise also involved in that OpenAI GB200 cluster, but, when our fate depends on being the fastest by far, we must have our own hands on the steering wheel, rather than be a backseat driver."

Musk xAI Ditches Oracle Cloud to Build Massive GPU Cluster for Grok 3

xAI already rents around 16,000 Nvidia GPUs from Oracle, making it one of the largest customers of the cloud service.

xAI plans to build "the world’s most powerful supercomputer" in Memphis, Tennessee. Musk said that he expects the supercomputer to open by the fall of 2025.

AI tool: PDF-to-Podcast

 knowsuchagency/pdf-to-podcast: Convert any PDF into a podcast episode! @GitHub, 
Python, Apache2

a tool to convert any PDF document into a podcast episode! Using OpenAI's text-to-speech models and Google Gemini, this tool processes the content of a PDF, generates a natural dialogue suitable for an audio podcast, and outputs it as an MP3 file.

Wednesday, July 10, 2024

marked: MarkDown parser & formatter

GitHub - markedjs/marked: A markdown parser and compiler. Built for speed.

Marked Documentation

  1. built for speed.
  2. a low-level markdown compiler for parsing markdown without caching or blocking for long periods of time.**
  3. light-weight while implementing all markdown features from the supported flavors & specifications.***
  4. available as a command line interface (CLI) and running in client- or server-side JavaScript projects.

Microsoft Azure AZ-204

Microsoft learning

AZ-204: Implement secure Azure solutions - Training | Microsoft Learn
1 hr 5 min earning Path 3 Modules

Udemy classes

AZ-204 Developing Solutions for Microsoft Azure - OCT 2023 | Udemy Business
The most complete course available on the Microsoft Azure developer exam AZ-204 with timed practice test.
by Scott Duffy 153K students, 11 hours, 4.5*

Three complete timed practice tests for AZ-204 Azure Developer exam, 120 questions, 100% original material
by Scott Duffy 18K students, 4.2*

by Refactored/Skylines, Joe Fecht, 4.5K students, 4.5*

Pluralisight classes

AZ-204: Developing Solutions for Microsoft Azure | Pluralsight
by Amy Coughlin 2.5*, 70 ratings
This course will teach you the skills and knowledge needed to approach the AZ-204 exam with confidence.

Configuring and Managing Microsoft Azure Key Vault | Pluralsight
by Ned Bellavance
Azure Key Vault provides life-cycle management for keys, secrets, and certificates. In this course you will learn the basics of using and managing Key Vault, including creating and securing Key Vault, storing sensitive data, and auditing access.

Tuesday, July 09, 2024

dukpy: JavaScript interpreter for Python

GitHub - amol-/dukpy: Simple JavaScript interpreter for Python @GitHub, MIT

DukPy is a simple javascript interpreter for Python built on top of duktape engine without any external dependency. It comes with a bunch of common transpilers built-in for convenience:
  • CoffeeScript
  • BabelJS
  • TypeScript
  • JSX
  • LESS

Amazon Robotics

Amazon Robotics - Wikipedia

amazon warehouse robot

Amazon Robotics

Amazon Robotics |

Amazon Grows To Over 750,000 Robots As World's Second-Largest Private Employer Replaces Over 100,000 Humans

Sunday, July 07, 2024

AWS SDK: Go: v2 vs v1 (eos)

aws/aws-sdk: Landing page for the AWS SDKs on GitHub

aws/aws-sdk-go-v2: AWS SDK for the Go programming language.

aws-sdk-go-v2 is the v2 AWS SDK for the Go programming language.

The v2 SDK requires a minimum version of Go 1.20.

Getting Started
Getting Help
More Resources

aws/aws-sdk-go: AWS SDK for the Go programming language.

aws-sdk-go is the v1 AWS SDK for the Go programming language.

announced the upcoming end-of-support (July 30, 2024) for AWS SDK for Go (v1). We recommend that you migrate to AWS SDK for Go v2. For dates, additional details, and information on how to migrate, please refer to the linked announcement.

Architecture: Lindal: Frank Lloyd Wright inspired homes

 Crystal Springs | 2168 SF | Lindal Imagine Series | Frank Lloyd Wright Inspired Homes

Cedar Homes of Bucks County | Lindal Cedar Homes | Custom Homes

Saturday, July 06, 2024

AI Architect: Frank Lloyd Wright style for Ocean City NJ

by Microsoft / Bing Copilot

Frank Lloyd Wright Usonian Inspired Houses | Lindal Cedar Homes

AWS SDK: JS: v3 vs v2 (eos)

aws/aws-sdk: Landing page for the AWS SDKs on GitHub

aws/aws-sdk-js: AWS SDK for JavaScript in the browser and Node.js

announced the upcoming announced the upcoming end-of-support (
September 7, 2024) for AWS SDK for JavaScript v2. We recommend that you migrate to AWS SDK for JavaScript v3. For dates, additional details, and information on how to migrate, please refer to the linked announcement.

The AWS SDK for JavaScript v3 is the latest and recommended version, which has been GA since December 2020. Here is why and how you should use AWS SDK for JavaScript v3. You can try our experimental migration scripts in aws-sdk-js-codemod to migrate your application from v2 to v3.
To get help with your migration, please follow our general guidelines to open an issue and choose guidance. To give feedback on and report issues in the v3 repo, please refer to Giving feedback and contributing. for AWS SDK for JavaScript v2. We recommend that you migrate to AWS SDK for JavaScript v3. For dates, additional details, and information on how to migrate, please refer to the linked announcement.

The AWS SDK for JavaScript v3 is the latest and recommended version, which has been GA since December 2020. Here is why and how you should use AWS SDK for JavaScript v3. You can try our experimental migration scripts in aws-sdk-js-codemod to migrate your application from v2 to v3.

To get help with your migration, please follow our general guidelines to open an issue and choose guidance. To give feedback on and report issues in the v3 repo, please refer to Giving feedback and contributing.

aws/aws-sdk-js-v3: Modularized AWS SDK for JavaScript.

The AWS SDK for JavaScript v3 is a rewrite of v2 with some great new features. As with version 2, it enables you to easily work with Amazon Web Services, but has a modular architecture with a separate package for each service. It also includes many frequently requested features, such as a first-class TypeScript support and a new middleware stack. For more details, visit blog post on general availability of Modular AWS SDK for JavaScript.

To get started with JavaScript SDK version 3, visit our Developer Guide or API Reference.

Friday, July 05, 2024

OpenSearch Docker & classes

opensearchproject/opensearch - Docker Image | Docker Hub

OpenSearch is a scalable, flexible, and extensible open-source software suite for search, analytics, and observability applications derived from Elasticsearch 7.10.2 and Kibana 7.10.2 and licensed under Apache 2.0⁠. It consists of a search engine daemon (OpenSearch) and a user interface for data administration and visualizations (OpenSearch Dashboards).

docker pull opensearchproject/opensearch

docker run -it -p 9200:9200 -p 9600:9600 -e OPENSEARCH_INITIAL_ADMIN_PASSWORD=<strong-password> -e "discovery.type=single-node"  --name opensearch-node opensearchproject/opensearch:latest

curl -X GET "https://localhost:9200" -ku admin:<password>
curl -X GET "https://localhost:9200/_cat/nodes?v" -ku admin:<password>
curl -X GET "https://localhost:9200/_cat/plugins?v" -ku admin:<password>

Udemy Classes

by Bo Andersen (Denmark)
Rating: 4.6 out of 5
22,323 ratings
132,523 Students
12.5 hours

by Frank Kane (x Amazon)
Rating: 4.6 out of 5
5,837 ratings
45,124 Students
15.5 hours

Rating 4.5
7,957 ratings
46,647 Students
6 hours

Organic Architecture by Frank Lloyd Wright

Frank Lloyd Wright A Film by Ken Burns and Lynn Novick : Free Download, Borrow, and Streaming : Internet Archive

Frank Lloyd Wright's Penfield Usonian House - YouTube

Organic Architecture:
"where the whole is to the part what the part is to the whole"

  • Sense of shelter - main purpose of architecture.
  • Love and respect for Nature, the ultimate context, street names.
  • Return to natural biorhythms (Donna Penfield).
  • No separation from the elements.
  • Supremely human - humane architecture.
  • Grace to its site rather than a disgrace.
  • The building belongs where you see it standing.
  • Alternate universe - possibility, beauty, privacy, peace, space, comfort.
  • Complete work of art - part to whole, whole to part.
  • Integrity. A part of the landscape, environment - connection.
  • Free, democratic, individual - unconstrained by styles.
  • Guest books FULL of praise and appreciation.
  • Spiritual, simplicity.
  • Truth Against The World award speech. 

Frank Lloyd Wright's Organic Plans - YouTube by Stewart Hicks

Organic means, in the philosophic sense, entity.
Where the whole is [to] the part as the part is to the whole
and where
the nature of the materials,
the nature of the purpose,
the nature of the entire performance 
becomes a necessity”.

to Wright “Form and function are one”

Organic Architecture | The Guggenheim Museums and Foundation

I’d like to have a free architecture. I’d like to have architecture that belonged where you see it standing, and was a grace to the landscape instead of a disgrace.
—Frank Lloyd Wright

J.A. Sweeton Residence - Wikipedia Cherry Hill, NJ

Usonia - Wikipedia
Usonia (/jˈsniə/) is a word that was used by the American architect Frank Lloyd Wright to refer to the United States in general (in preference over America), and more specifically to his vision for the landscape of the country, including the planning of cities and the architecture of buildings. 

EV Ford future; F-150 hybrid as power generator

excellent interview!

Ford CEO Jim Farley discusses the rapid evolution of electric vehicles - YouTube

interesting: using hybrid Ford (F-150) as a electric power generator for house

new Ford EV build from ground up, for $30K, profitable, coming in two years!

to complete with Tesla Model 2 and BYD

Ford quietly created its own ‘skunkworks’ team to develop low-cost electric vehicles - The Verge

The division is reportedly being led by ex-Tesla engineer Alan Clarke and is working on a flexible battery platform for Ford’s third-generation EVs.

Thursday, July 04, 2024

AI: Spreadsheet Is All You Need

 dabochen/spreadsheet-is-all-you-need: A nanoGPT pipeline packed in a spreadsheet

A nanoGPT pipeline packed in a spreadsheet

This is a project to help ... understand how GPT works.
It is pretty fun to play with, especially when you are trying to figure out what exactly is going on inside a transformer.
This helped to visualize the entire structure and the data flow.
All the mechanisms, calculations, matrices inside are fully interactive and configurable.

It contains all the transformer components including:
  • embedding
  • layer norm
  • self attention
  • projection
  • MLP
  • softmax
  • logits

It is based on Andrej Karpathy's NanoGPT structure which includes roughly 85000 parameters.
It is clearly a very small size, but it is both complex enough to understand how it works, and also not too big to crash computer. In contrast to chatgpt, this project is a character based prediction system, meaning that each token is a character, and to reduce the complexity, only letter A/B/C are being tokenized.

Rimac Robotaxi? EV self driving:

 Rimac Just Exposed Tesla's Robotaxi Playbook. - YouTube

Rimac Automobili - Wikipedia

Rimac Automobili (Croatian pronunciation: [rǐːmats automobǐːli]REE-mahts) is a Croatian automotive manufacturer headquartered in Sveta Nedelja, Croatia, that develops and produces electric sports cars. Its sister company, Rimac Technology (part of the Rimac Group) also produces drivetrains and battery systems for automotive businesses.

In July 2021, VW Group's Porsche and Rimac announced a joint venture that would incorporate Volkswagen's high-performance Bugatti brand as well as Rimac Automobili. The new venture is called Bugatti Rimac.[15] Parent company Rimac Group announced a 55% stake in the joint venture, with Porsche holding a 45% stake in the company.[16] in June of that year, Porsche stated it was one of the participants in a new €500 million round of fundraising.[17]

Wednesday, July 03, 2024

Node.js Worker threads

How To Use Multithreading in Node.js | DigitalOcean

Node.js runs JavaScript code in a single thread, which means that your code can only do one task at a time. However, Node.js itself is multithreaded and provides hidden threads through the libuv library, which handles I/O operations like reading files from a disk or network requests. Through the use of hidden threads, Node.js provides asynchronous methods that allow your code to make I/O requests without blocking the main thread.

Node.js introduced the worker-threads module, which allows you to create threads and execute multiple JavaScript tasks in parallel. Once a thread finishes a task, it sends a message to the main thread that contains the result of the operation so that it can be used with other parts of the code. The advantage of using worker threads is that CPU-bound tasks don’t block the main thread and you can divide and distribute a task to multiple workers to optimize it.

A complete guide to threads in Node.js - LogRocket Blog

An Introduction to Node.js Multithreading — SitePoint

Worker threads | Node.js v22.2.0 Documentation

Microsoft Dynamics 365 Power Platform Training


dev training

Microsoft Dynamics 365 (CRM) &Power Platform Training (2024) | Udemy Business

Microsoft Dynamics 365 & PowerApps Developer Course - Part 1 | Udemy Business

Microsoft Dynamics 365 & PowerApps Developer Course - Part 2 | Udemy Business

user training

MB-910: Microsoft Dynamics 365 Fundamentals (CRM) | Udemy Business

Microsoft Learn


Microsoft 365 Basic Concepts and Fundamentals Path | Pluralsight

Tuesday, July 02, 2024

AI in-security: Samsung & ChatGPT

 Samsung bans use of generative AI tools like ChatGPT after April internal data leak | TechCrunch

May 2, 2023

A month after internal, sensitive data from Samsung was accidentally leaked to ChatGPT, Samsung is cracking down on usage of the generative AI service. The electronics giant is temporarily restricting the use of generative AI tools on company-owned devices, covering computers, tablets and phones, as well as non-company-owned devices running on internal networks. The ban would cover not just ChatGPT, but services that use the technology like Microsoft’s Bing, as well as competing generative AI services like Bard from Google.

AI Architecture: Usonian philosophy by Frank Lloyd Wright

 Lindal Cedar offers designs based on Frank Lloyd Wright homes

Usonia, a derivate term for the "United States of America" coined by Wright, refers to a set of principles the architect used to create more than 100 houses around the US starting in the 1930s.
  • integration with nature, 
  • use of natural materials, 
  • open floor plans, 
  • affordability
  • energy efficiency.
Seattle-based manufacturer Lindal Cedar Homes has partnered with the Frank Lloyd Wright Foundation to create a series of purchasable house designs informed by the American architect's Usonian philosophy.

Founded by Frank Lloyd Wright as the Taliesin Fellowship in 1932

AI Copilot Designer:

prompt: Design an octagon house in style of Frank Lloyd Wright, with Usonian philosophy and architecture