Skip to main content
Photo of DeepakNess DeepakNess

Raw Notes

Raw notes include useful resources, incomplete thoughts, ideas, micro thoughts, and learnings as I go about my day. Below, you can also subscribe to the RSS feed to stay updated:

https://deepakness.com/feed/raw.xml

Total Notes: 199


AI collapses

I had an interesting exchange of thoughts about AI or rather the future of AI with Ralf Christian on X. He made some great points that I thought should collect here:

I think the main problem is the tech itself. It doesn't 'know' anything, it 'simply' spits out content based on probabilities in the training data.

What's good in the training data is the spoken language. That's why it 'speaks' so well. But the training data is full of wrong info, that's why there's wrong output even after reasoning.

If people publish less human written content and more AI generated content, and we don't have a way to identify that with 100% accuracy, this will definitely not make those models better in the future

You might be able to still improve it here and there, like that it better keeps context, but don't expect any leap here. That's why there are no big improvements since they released chatgpt 3

I think the future if this might be niche LLMs, where you train them on a specific topic with almost hand picked training data and fine tune it for your specific use. For example, if you're Microsoft you could train it with all your company's code. I guess this gives output more close to what you want than training it with all of githubs code randomly

ChatGPT is really impressive, but it's far from making a real difference in real business (unless you are into spam 😇)

Yesterday I tried to generate a video with Sora. It failed so hard. I think what you are seeing on social media is 1000 people trying to do a video, 999 generating crap and not posting it and 1 got lucky and posts it. That's not value, that's luck.

I loved the simple explanation he made. Also, I loved this paper on "AI models collapse when trained on recursively generated data" that Ralf shared earlier in the same thread.


Mozilla is shutting down Pocket

Mozilla recently announced that they are shutting down the Pocket app, which people used to save articles, videos, and other content formats to read later.

I, too, have used the app in the past but do not use anymore (I'm more of an RSS guy now, I do not save to read later). At a point, Mozilla integrated the Pocket app to the Firefox browser by default, in fact, they do this to this day.

But they would be shutting down everything except the Pocket newsletter, it will continue sending issues under a different name. And the main reason for closing the app they give is:

[...] the way people save and consume content on the web has evolved [...]

You had a good run, Pocket.


Collecting funny memes

I really really love memes, the funny ones. And funny memes are rare, so I have started collecting the ones that really made me laugh at some point. I'm saving them on a separate meme page here.

These memes would be related to tech, most of the time.


Kailash Nadh about MCP

Kailash Nadh, Zerodha's CTA, has written an interesting blog post about MCP where he presents different scenarios of how MCP can be used, and also talks about the rapid adoption.

The funny thing is, as a technical construct, there is nothing special about MCP. It is a trivial API spec which has in fact suffered from poor design and fundamental technical and security issues from the get go. It does not matter if its internals change, or it even outright gets replaced by some XYZ tomorrow. Questions about privacy, security, correctness, and failures will continue to loom for a good while, irrespective of whether it is technically MCP or XYZ.

He talks about how, traditionally, connecting different software systems required extensive manual coding but MCP allows connecting services instantly.

I liked that he also talked about the concerns, as he worries about:

  • AI systems making real-world decisions with minimal human oversight
  • Questions of accountability when things go wrong
  • Privacy and security implications

One might imaginatively call it … SkyNet.

He also playfully compares MCP to SkyNet while calling it a "global, interconnected, self-organising meta system".

Overall, it's a balanced post sharing his technical excitements with genuine concerns about such AI systems getting full access to real-world services and decision-making power.

By the way, I almost forgot to mention that Zerodha itself has launched Kite MCP a few days ago.


Remix is moving on from React

In an unreleased blog post, Remix.run has mentioned that they are moving on from React to a completely new thing, as a person pointed out on X. And I think, this will be a huge step.

In this .md file, they mention that:

That's why Remix is moving on from React[...]

Remix v3 is a completely new thing. It's our fresh take on simplified web development with its own rendering abstraction in place of React.

Inspired by all the great tech before it (LAMP, Rails, Sinatra, Express, React, and more), we want to build the absolute best thing we know how to build with today's capable web platform.

This requires a declaration of independence from anybody else's roadmap.

They mention that they are not ready for a preview release yet, but this is the route that they are taking forward. They have really bold claims in the blog post that you must go through.


Anthropic launches Claude Opus and Sonnet 4

Anthropic just launched their most awaited Claude 4 Opus and Claude 4 Sonnet models. They tweeted:

Introducing the next generation: Claude Opus 4 and Claude Sonnet 4.

Claude Opus 4 is our most powerful model yet, and the world’s best coding model.

Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.

Claude 4 Sonnet is for "near instant response" whereas Claude 4 Opus is for extended thinking and deeper reasoning. And they both are significantly better than Claude's previous models as well as OpenAI's and Google's latest models (OpenAI Codex-1, OpenAI o3, OpenAI GPT-4.1, and Google Gemini 2.5 Pro) in terms of software engineering.

Starting today, Claude 4 Opus is available for the paid users and Claude 4 Sonnet is available for free users as well on claude.ai.


AI SEO research papers

The SEO landscape is changing and it's going to be heavily influenced by AI in the coming years. And here is a list of some really useful research papers that one should study to stay relevant.


Technologies claiming to replace software engineers

I asked Grok 3 DeeperResearch tool to find a list of technologies throughout the history that have claimed to replace software engineers, and it did pull up some cool mentions:

1. Automatic Programming and Compilers (1950s)

  • Era: 1950s
  • Description: Early efforts to automate programming, such as the A-0 compiler by Grace Hopper, focused on translating high-level languages like Fortran into machine code.
  • Claims: Suggested that programming could be automated, reducing the need for low-level coding expertise.
  • Impact: Enabled higher-level programming but did not eliminate the need for programmers to design algorithms and logic.
  • Source: Wikipedia: Automatic Programming

2. Fourth-Generation Programming Languages (4GLs) (1970s-1990s)

  • Era: 1970s-1990s
  • Description: High-level languages like SQL and FoxPro designed to be closer to human language, enabling non-programmers to perform tasks like database queries.
  • Claims: Hyped as potentially eliminating the need for traditional programmers, with - Claims: that they were the "last generation" requiring code writing.
  • Impact: Simplified specific tasks but were limited for complex projects, requiring professional developers for broader applications.
  • Source: Wikipedia: Fourth-generation programming language

3. Computer-Aided Software Engineering (CASE) Tools (1980s-1990s)

  • Era: 1980s-1990s
  • Description: Tools to automate software development stages, including analysis, design, and code generation, introduced by companies like Nastec.
  • Claims: Promised significant productivity gains, potentially reducing manual coding needs, often oversold as "silver bullets."
  • Impact: Faced adoption challenges due to complexity and cost, augmenting rather than replacing developers.
  • Source: Wikipedia: Computer-aided software engineering

4. Visual Programming Languages (1990s)

  • Era: 1990s
  • Description: Languages like Visual Basic (introduced 1991) used graphical interfaces for rapid application development.
  • Claims: Made programming accessible to non-experts, suggesting a reduced need for specialized skills for certain applications.
  • Impact: Democratized app development but required skilled developers for complex projects.
  • Source: Wikipedia: Visual Basic

5. Low-Code/No-Code Platforms (2000s-Present)

  • Era: 2000s-Present
  • Description: Platforms like Mendix and OutSystems enable application development with minimal coding, coined as "low-code" in 2014.
  • Claims: Marketed as empowering citizen developers, potentially reducing reliance on professional programmers for routine tasks.
  • Impact: Expanded development scope but professional developers remain essential for complex and custom solutions.
  • Source: Wikipedia: Low-code development platform

6. AI-Assisted Coding Tools (2020s)

  • Era: 2020s
  • Description: Tools like GitHub Copilot (launched 2021) use AI to suggest and generate code, assisting developers in real-time.
  • Claims: Sparked fears of replacing programmers, with discussions questioning if AI could automate coding tasks.
  • Impact: Enhances productivity but requires human oversight for problem-solving and quality assurance.
  • Source: Wikipedia: GitHub Copilot

Please note that these are completely AI-generated, I haven't edited a single thing here.


Vibe coding won't replace software engineers

I came across a very interesting LinkedIn post by Judah Diament where he makes a point that vive coding won't be replacing software engineers. Below are some interesting fragments of the post:

Vibe coding enables people who aren't well trained computer scientists to create complete, working applications. Is this a breakthrough? Not even close - there have been such tools since the late 1980s. See, for example: Apple HyperCard, Sybase PowerBuilder, Borland Delphi, FileMaker, Crystal Reports, Macromedia (and then Adobe) Flash, Microsoft VisualBasic, Rational Rose and other "Model Driven Development" tools, IBM VisualAge, etc. etc. And, of course, they all broke down when anything sightly complicated or unusual needs to be done (as required by every real, financially viable software product or service), just as "vibe coding" does.

Then he goes on to explaining why vibe coding won't be replacing software engineers:

To claim that "vibe coding" will replace software engineers, one must: 1) be ignorant of the 40 year history of such tools or 2) have no understanding of how AI works or 3) have no real computer science education and experience or 4) all of the above, OR, most importantly, be someone trying to sell something and make money off of the "vibe coding" fad.

I like how the last paragraph is framed, it's definitely some food for thought.


Less known HTML tags

I collected a list of 28 less known but very useful HTML tags:

1. Abbreviation – abbr

Marks abbreviations and shows full form on hover.

Example:

HTML

HTML:

<abbr title="HyperText Markup Language">HTML</abbr>

2. Bi-directional Isolation – bdi

Isolates text that may have different writing directions.

Example:

User علي logged in.

HTML:

User <bdi>علي</bdi>

3. Calculation Result – output

Shows the result of a calculation or user input.

Example:

+ = 7

HTML:

<form oninput="result.value=parseInt(a.value)+parseInt(b.value)">
  <input name="a" type="number" value="3"> +
  <input name="b" type="number" value="4"> =
  <output name="result">7</output>
</form>

4. Citation – cite

Used to reference the title of a work like a book, movie, article, or website. Usually shown in italics by browsers.

Example:

The Great Gatsby is a novel by F. Scott Fitzgerald.

HTML:

<p><cite>The Great Gatsby</cite> is a novel by F. Scott Fitzgerald.</p>

5. Contact Information – address

Used to provide contact details for a person, group, or organization. Usually displayed in italics and often used in footers.

Example:

Contact us at support@example.com

HTML:

<p><address>Contact us at support@example.com</address></p>

6. Definition – dfn

Marks the term being defined, often used in technical writing.

Example:

Latency is the delay before a transfer of data begins.

HTML:

<dfn>Latency</dfn>

7. Deleted Text – del

Used to mark removed text. Often shown with a strike-through.

Example:

This was removed.

HTML:

<p>This was <del>removed</del>.</p>

8. Description List with Terms and Details – dl + dt + dd

Used to list terms and their descriptions. <dl> wraps the whole list, <dt> defines the term, and <dd> gives the description.

Example:

HTML
A markup language for web pages.
CSS
Used to style HTML content.

HTML:

<dl>
  <dt>HTML</dt>
  <dd>A markup language for web pages.</dd>
  <dt>CSS</dt>
  <dd>Used to style HTML content.</dd>
</dl>

9. Direction Override – bdo

Forces a section of text to display in a specified direction.

Example:

Hello World

HTML:

<bdo dir="rtl">Hello World</bdo>

10. Expandable Section – details + summary

Creates a collapsible content box that can be expanded by the user.

Example:

Click to expandThis text is hidden until clicked.

HTML:

<details>
  <summary>Click to expand</summary>
  This text is hidden until clicked.
</details>

11. Group Form Fields – fieldset + legend

Groups related form inputs and adds a caption using <legend>.

Example:

Login

HTML:

<fieldset>
  <legend>Login</legend>
  <input type="text" placeholder="Username">
  <input type="password" placeholder="Password">
</fieldset>

12. Heading Group – hgroup

Groups a set of headings (like h1 to h6) when a heading has a subtitle or multiple levels. Helps with document outline.

Example:

Main Title

Subtitle

HTML:

<hgroup>
  <h1>Main Title</h1>
  <h2>Subtitle</h2>
</hgroup>

13. Hidden Content – template

Stores HTML that is not rendered until used with JavaScript.

Example:

HTML:

<template>
  <p>This is hidden and not rendered.</p>
</template>

14. Highlight Text – mark

Used to highlight part of text, often shown with yellow background.

Example:

This is important text.

HTML:

<p>This is <mark>important text</mark>.</p>

15. Inline Quotation – q

Used for short quotations that are displayed inline. Browsers usually add quotation marks automatically.

Example:

She said, Always write clean code.

HTML:

<p>She said, <q>Always write clean code.</q></p>

16. Inserted Text – ins

Used to mark text that was added later. Often shows as underlined.

Example:

This is new text.

HTML:

<p>This is <ins>new</ins> text.</p>

17. Keyboard Input – kbd

Used to show keyboard input, like shortcuts or key presses.

Example:

Press Ctrl + V

HTML:

<kbd>Ctrl</kbd> + <kbd>V</kbd>

18. Option Group – optgroup

Used to group related options inside a <select> dropdown, making it easier for users to choose from categorized lists.

Example:

HTML:

<select>
  <optgroup label="Fruits">
    <option>Apple</option>
    <option>Banana</option>
  </optgroup>
</select>

19. Program Output – samp

Represents output from a program, like an error or log message.

Example:

Login failed: incorrect password

HTML:

<samp>Login failed: incorrect password</samp>

20. Progress Bar – progress

Shows progress of a task like loading or uploading.

Example:

HTML:

<progress value="40" max="100"></progress>

21. Ruby Annotations – ruby + rt + rp

Used in East Asian text to show pronunciation hints.

Example:

kanji

HTML:

<ruby>漢<rt>kan</rt>字<rt>ji</rt></ruby>

22. Shown If JS Disabled – noscript

Displays content only if JavaScript is disabled in the browser.

Example:

HTML:

<noscript>JavaScript is disabled in your browser.</noscript>

23. Subscript Text – sub

Displays text lower and smaller than the baseline, commonly used in chemical formulas or math expressions.

Example:

H2O

HTML:

<p>H<sub>2</sub>O</p>

24. Superscript Text – sup

Displays text higher and smaller than the baseline, often used for exponents or footnotes.

Example:

E = mc2

HTML:

<p>E = mc<sup>2</sup></p>

25. Time or Date – time

Represents a specific time or date, useful for events or timestamps.

Example:

HTML:

<time datetime="2025-05-18">May 18, 2025</time>

26. Value Within Range – meter

Displays a value inside a known range, like disk or battery levels.

Example:

60%

HTML:

<meter value="0.6">60%</meter>

27. Variable Name – var

Used to show variables in math or programming context.

Example:

x + y = 10

HTML:

<var>x</var> + <var>y</var> = 10

28. Word Break Opportunity – wbr

Suggests a possible break point in a long word or URL.

Example:

www.examplelongword.com

HTML:

www.example<wbr>long<wbr>word.com

Michael Jamin about comedy and drama

"Comedy hits you in the head, drama hits you in the heart. If you want people to remember your work, you need both: comedy to lower their guard, drama to make them feel."

– Michael Jamin, a Hollywood screenwriter


OpenAI launches Codex

OpenAI launches Codex, a cloud-based agent that writes code and works on multiple tasks at once. It's just launched, and can be accessed from inside ChatGPT at chatgpt.com/codex but visiting this URL just redirected me back to ChatGPT as it's only for ChatGPT Pro users, and not Plus users.

Currently, it's in a research preview but it's said to have features like:

  • writing code for you
  • implementing new features
  • answering questions about your codebase
  • fixing bugs, etc.

The implementation is very interesting as it runs in its own cloud sandbox environment, and can be directly connected to your GitHub repo. It performs better than o1-high, o4-mini-high, and o3-high.

The cool thing is, it can also be guided by an AGENTS.md file placed within the repository. Very cool.

Today, we’re also releasing a smaller version of codex-1, a version of o4-mini designed specifically for use in Codex CLI.

Yes, they're also releasing something for Codex CLI as well. And about the pricing and availability:

Starting today, we’re rolling out Codex to ChatGPT Pro, Enterprise, and Team users globally, with support for Plus and Edu coming soon. [...] We plan to expand access to Plus and Edu users soon.

For developers building with codex-mini-latest, the model is available on the Responses API and priced at $1.50 per 1M input tokens and $6 per 1M output tokens, with a 75% prompt caching discount.

I am excited to see how this compares to Claude 3.7 Sonnet and Gemini 2.5 Pro in terms of coding, fixing bugs, designing UI, etc. I also uploaded a quick video about the same that you can watch on YouTube.


List of cool MCP servers

I have been coming across a lot of cool MCP server while browsing the internet, so decided to create a dedicated page and keep collecting MCPs here. I have a JSON file where I can add the new MCP servers, and it will automatically show in the card format here.

BioMCP

Connects AI systems to authoritative biomedical data sources

Learn more

Blender MCP

Allows Claude AI to directly interact with and control Blender

Learn more

Context7 MCP

Pulls up-to-date documentation and code examples straight from the source

Learn more

Google Keep MCP

A MCP server to reads, creates, updates and deletes Google Keep notes

Learn more

Logseq MCP

Securely queries and interacts with your Logseq knowledge base

Learn more

MindDB MCP

Lets you query data from 200+ sources like Slack and Gmail in both SQL and natural language

Learn more

Playwright MCP

Enables LLMs to interact with web pages through structured accessibility snapshots

Learn more

Postgres MCP

Enables LLMs to inspect database schemas and execute read-only queries

Learn more

TaskMaster MCP

A powerful task-management system you can drop into Cursor and others

Learn more

I will keep updating this list as I discover more such MCPs.


How to Connect ChatGPT to Airtable

Connecting ChatGPT to Airtable gives you the superpower to get answers to 100s of questions in no time. Here's how to do that:

You need the following things to be able to connect ChatGPT to Airtable:

  1. A paid Airtable account (the lowest plan is $24/month)
  2. OpenAI API key (you'll have to set up a payment method on OpenAI, here)
  3. The Scripting extension from Airtable (no additional cost), and
  4. A script to call the OpenAI API inside Airtable

And below is the function that you can use to call the OpenAI from inside the Airtable and get the output.

async function getGPTResponse() {
    const userInput = "why is the sky blue?";
    const maxTokens = 500;
    const temperature = 0.7;
    const model = "gpt-4.1";
    const systemPrompt = "be precise";

    const messages = [
        { role: "system", content: systemPrompt },
        { role: "user", content: userInput },
    ];

    const res = await fetch('https://api.openai.com/v1/chat/completions', {
        method: 'POST',
        headers: {
            'Content-Type': 'application/json',
            'Authorization': `Bearer ${openaiApiKey}`,
        },
        body: JSON.stringify({
            model,
            messages,
            max_tokens: maxTokens,
            temperature,
        }),
    });

    const data = await res.json();
    return data.choices?.[0]?.message?.content || null;
}

Here, userInput is the prompt that you give AI, maxTokens is the max tokens for the model, temperature is model temperature, and systemPrompt is the system prompt. The prompt here is hardcoded, but you can modify the script to dynamically fetch prompts from each row and then get the outputs accordingly.

ChatGPT is very good at doing this implementation as per your base data, you can just give the above script and other details in the prompt, and it will give you the final code that you can put inside the Scripting extension.

Also, there's a generic version of this script at InvertedStone that you can also get and use. You can generate almost any kind of content using this script, not just from ChatGPT but also from other AI models like Claude, Gemini, Perplexity, and more.


Donald Knuth about understanding

The ultimate test of whether I understand something is if I can explain it to a computer. I can say something to you and you’ll nod your head, but I’m not sure that I explained it well. But the computer doesn’t nod its head. It repeats back exactly what I tell it. In most of life, you can bluff, but not with computers.

– Donald Knuth


Google Docs new Markdown copy-paste options

Came to know that Google Docs now has a "Copy as Markdown" and "Paste from Markdown" option under the Edit menu at the top. You can select some text to highlight the copy option and then any Markdown is also pasted in the document with proper formatting.

Very cool!

By the way, Google Docs already had the option to download the entire document as a .md file, but these copy and paste options are even more user friendly.


About action and information

“When action grows unprofitable, gather information; when information grows unprofitable, sleep.”

― Ursula K. LeGuin, The Left Hand of Darkness


React Router inside Next.js

I saw a person using the React Router inside Next.js and I have so many questions. Like the navigation is visibly very fast, but my questions are:

  1. Is it good for public pages? Because I think, it will have same SEO issues as SPAs.
  2. Does it make the codebase more complicated?

Upon looking I found a detailed blog post on building a SPA using Next.js and React Router. It mentions the reason for not using the Next.js router:

Next.js is not as flexible as React Router! React Router lets you nest routers hierarchically in a flexible way. It's easy for any "parent" router to share data with all of its "child" routes. This is true for both top-level routes (e.g. /about and /team) and nested routes (e.g. /settings/team and /settings/user).

I do understand why someone would want to use Next.js but I have yet to learn more about this React Router thing.

BRB.

Update:

Josh has written a new short blog post about how he did it, definitely worth reading and understanding the process.


Best way to create CSS cards

Just noting this for myself for future reference that whenever I have to create cards, I must use this simpler method each time. If the HTML is like this:

<div class="card-container">
    <div class="card">
        <p>Card 1 content</p>
    </div>
    <div class="card">
        <p>Card 2 content</p>
    </div>
</div>

The CSS should be like this:

.card-container {
  display: grid;
  grid-template-columns: repeat(auto-fit, minmax(300px, 1fr));
  gap: 20px;
  margin: 0 auto;
}

/* and then whatever CSS for .card here */

It's clean and quick.


Important technologies with boring websites

I’ve compiled a list of websites for important web technologies that are likely to have old but functional designs. These are fundamental tools for the internet, often open-source, and their websites prioritize functionality over aesthetics, reflecting their long-standing nature.

FFmpeg

A multimedia framework for transcoding, streaming, and playing various media formats.

SQLite

A self-contained, serverless, SQL database engine widely used in applications.

Apache HTTP Server

An open-source HTTP server that powers a significant portion of the web.

Nginx

A high-performance HTTP server and reverse proxy used for web serving and load balancing.

PostgreSQL

A powerful, open-source object-relational database system used for data storage and management.

MySQL

An open-source relational database management system widely used in web applications.

Python

An interpreted, high-level, general-purpose programming language used extensively in web development.

Ruby

A dynamic, open-source programming language known for its simplicity and productivity.

Git

A distributed version control system essential for managing source code in software development.

Linux Kernel

The core of the Linux operating system, providing essential services for computing systems.

GNU Project

A collection of free software, including the GNU operating system, which is Unix-like but free.

TeX

A typesetting system that is the standard for creating books and articles with complex mathematics.

Vim

A highly configurable text editor built for efficient text editing, especially for developers.

Emacs

An extensible, customizable text editor that also serves as a development environment.

Perl

A high-level, general-purpose, interpreted programming language used for text processing.

Tcl

A scripting language with a simple API for embedding into C/C++ applications.

OpenSSH

A suite of secure networking utilities based on the SSH protocol for secure remote access.

OpenSSL

A software library for applications that secure communications over computer networks.

BIND

The most widely used Domain Name System (DNS) software on the internet.

I will keep updating this list as I discover more such websites.


WhatsApp AI chatbot in Python

I came across a GitHub repo containing the complete Python code host and run a WhatsApp AI chatbot. I have forked the repo as I am thinking of making such a chatbot for myself. The requirements are mentioned as:

  • WaSenderAPI: Only $6/month for WhatsApp integration
  • Gemini AI: Free tier with 1500 requests/month
  • Hosting: Run locally or on low-cost cloud options
  • No WhatsApp Business API fees: Uses WaSenderAPI as an affordable alternative

I will learn more about the WhatsApp business API and how it can be used to create a WhatsApp chatbot for specific topics that people can interact with. And then how it can all be monetized.


Stripe's new transfer-based model

Stripe has developed a new approach to analyze transactions using a new transformer-based foundation model. Earlier, they relied on a traditional machine learning model but these models had limitations, but the new model is supposed to even increase the conversion even more and significantly decrease the fraudulent transactions.

Gautam Kedia, an AI/ML engineer at Stripe, explained this in a detailed X post. He mentions:

So we built a payments foundation model—a self-supervised network that learns dense, general-purpose vectors for every transaction, much like a language model embeds words. Trained on tens of billions of transactions, it distills each charge’s key signals into a single, versatile embedding.

This approach improved our detection rate for card-testing attacks on large users from 59% to 97% overnight.

While I did have a loose knowledge of what a transformer is, I looked up its definition again to understand it better in the context of payments:

A Transformer is a type of neural network architecture that has revolutionized natural language processing (NLP) and is now being applied to other domains, as seen in the Stripe example. Its key innovation is the attention mechanism.

The attention mechanism allows the model to weigh the importance of different parts of the input sequence when processing any single part.

Further, I asked Gemini to explain this entire thing to me in a simpler words and here's how it explained:

Think of it like reading a book. An older model might read word by word and only remember the last few words. A Transformer, with its attention mechanism, can look back at earlier parts of the book to understand the meaning of the current sentence in the broader context. In the payment world, this means understanding the significance of a transaction not just in isolation, but in the context of previous transactions.

Very cool.


MCP has a new problem

Someone added more than 81 MCP tools to their Cursor IDE and it started showing a warning saying "too many tools can degrade performance" and it suggested to use less than 40 tools.

Cursor CEO replied the following:

you'll be able to disable individual tools in 0.50 :)

But the problem still remains, if MCPs are the future, there has to be a way that they are automatically managed and I do not need manually enable or disable tools.