shlogg 路 Early preview
Franck Pachot @franckpachot

Developer Advocate 鈻濃枮 YugabyteDB - postgres compatible distributed SQL open-source database, 馃吘Oracle Certified Master, 馃崈MongoDB Certified Associate Data Modeler, 馃悩 PostgreSQL fan,馃敹AWS Data Hero

MongoDB Storage Size With TTL Indexes

MongoDB's TTL indexes don't cause fragmentation when used with deletion, as space is reused automatically, maintaining a constant free space of 25%. Manual compaction can temporarily reclaim more space but returns to normal volume.

Joining MongoDB: 30 Years Of SQL Expertise

Joining @MongoDB in 2025! 30 yrs exp with SQL dbs (Oracle, Amazon RDS, PostgreSQL, YugabyteDB). Passionate about data modeling & troubleshooting performance issues.

Preventing Row-by-Row Flush In YugabyteDB With UUIDv7 Function

When creating user-defined functions in YugabyteDB, use simple expressions to avoid row-by-row flushing (Write Buffering). Avoid `generate_series` or `with recursive`. Use `create or replace function` with a simple SQL expression.

Predictable Results With LIMIT Or FETCH FIRST Require ORDER BY

if you want predictable results with LIMIT or FETCH FIRST in PostgreSQL, use ORDER BY to ensure consistent output order. Without ORDER BY, concurrent sequential scans can lead to unpredictable results.

Understanding NULL In SQL: Unknown Values Vs Non-Existent Ones

NULL in SQL indicates an unknown value, not non-existence. Arithmetic operations with NULL yield unknown results. Databases may implement functions differently, e.g., PostgreSQL's greatest() & least() ignore null operands.

SQL NULLs Vs Non-Existing Rows In Outer Joins

In SQL, NULL represents an existing but unknown value, not non-existence. Outer joins reveal inexisting rows with all-null values, including keys. Proper data modeling distinguishes between these meanings.

Handling Nulls In NoSQL & RDBMS Databases Compared

Handling nulls in various databases: MongoDB treats null as absent, SQL uses 3-value logic, PostgreSQL & YugabyteDB follow SQL standard, Oracle & SQL Server have workarounds, DynamoDB considers null as empty value.

YugabyteDB Query Planning And Caching Behavior

YugabyteDB's query planner switches between custom & generic plans based on executions (5 by default) & estimated cost. `plan_cache_mode` can be set to "auto" for automatic switching.

YugabyteDB Error: Serialization Failure Due To Concurrent Update

YugabyteDB error 'could not serialize access due to concurrent update' occurs when reads & writes conflict. Understand isolation levels (read committed, repeatable read) & consistent states to prevent errors.

YugabyteDB Distributed Transactions

YugabyteDB uses 2 databases: RegularDB for long-term storage & IntentsDB for ongoing transactions. Reading is efficient due to LSM trees, while writing to IntentsDB triggers a flush of RegularDB to protect committed changes.

Optimizing YugabyteDB Queries With Immutable Functions And Indexing

Optimize YugabyteDB queries by creating an immutable `GMT_HH24MI` function & indexing extracted data. Push down predicates for Index-Only Scan & boost performance.

Aurora DSQL Vs YugabyteDB: Distributed SQL Database Comparison

Distributed SQL databases Aurora DSQL & YugabyteDB compared: Aurora prioritizes scalability over features, while Yugabyte supports more PostgreSQL features for high-performance applications.

Optimize Database Joins With YugabyteDB For Efficiency

Optimize database performance with YugabyteDB's Batched Nested Loop Join & Index Skip Scan! Denormalization should be a last resort, not a premature fix. Efficient joins scale effectively in large environments

OCC Errors In Aurora DSQL: A Scenario Analysis

OCC in Amazon Aurora Database assumes concurrent transactions won't interfere & allows them to proceed without sync until commit time. Developers must code around OCC to handle potential conflicts.

Avoiding Update Anomalies With Embedded Documents In YugabyteDB

Serializable isolation level prevents dirty reads by ensuring that transactions see a consistent view of the database until they commit or roll back. This allows multiple transactions to access shared resources without conflicts.

Aurora Limitless: Managed Sharding For PostgreSQL Applications

Aurora Limitless is a managed sharding option built on top of Aurora Serverless for PostgreSQL applications. It's not a different engine but makes operating easier with database sharding.

Passing MongoDB Associate Data Modeler Exam With Ease

Passed MongoDB Associate Data Modeler Exam with experience in SQL databases & data modeling. Preparation via MongoDB Data Modeling Path & MongoDB for SQL Professionals courses helped. Tips: go fast, eliminate worst options, flag uncertain questions.

Serializable Transactions In Databases: Workarounds And Best Practices

Serializable isolation level isn't always necessary, use explicit locking or partial indexes for reliable transactions in various databases.

Compiling PostgreSQL Extensions Easier With YugabyteDB 2.21+

Compiling PostgreSQL extensions easier in YugabyteDB 2.21+ due to native OS libraries used instead of Linuxbrew, simplifying dependency management and testing.

PostgreSQL Index Row Size Limitations And Alternatives

PostgreSQL's B-tree index limit: 2704 bytes. Migrating to YugabyteDB resolves this issue with its Log-Structured Merge-Tree, allowing larger index entries.

Debunking YugabyteDB Myths: 10 Essential Facts Revealed

YugabyteDB misconceptions abound due to weak comparisons & unverified claims by competitors. Here are 10 essential facts to debunk myths & provide clarity on its innovative architecture.

YugabyteDB Performance Benefits

YugabyteDB's storage method provides fast query performance even with high concurrency and large number of versions. It uses RocksDB, which stores data in immutable SST files, making snapshots and incremental backups efficient.

Advanced Indexing Strategies In YugabyteDB For Improved Performance

YugabyteDB offers two query APIs, YSQL & YCQL, both enforcing ACID properties, but with different performance & features for relational & NoSQL applications.

YugabyteDB Two-Layer Architecture Simplified

YugabyteDB's 2-layer design: SQL processing layer & distributed transactional storage. Both layers on all nodes for easy deployment & scaling. Supports multiple query APIs like YSQL & YCQL.

Maintaining Approximate Aggregates With Fast Query And Low Overhead

Maintain an approximate aggregate with fast query & low overhead on DML using indexes, ideal for big tables where exact count takes minutes

YugabyteDB Name Resolution Issues In No-Network Containers

Name resolution issues in YugabyteDB: add hostname to /etc/hosts or use --master_addresses=127.0.0.2:7100 to resolve errors like "Timed out" and "Network error

Start YugabyteDB In 2 Seconds With Yb-master And Yb-tserver

For a faster YugabyteDB lab start, run yb-master and yb-tserver directly with --replication_factor=1 and --fs_data_dirs for logs and data. This skips HA setup and downloads the latest preview version.

YugabyteDB Distributed UDF Execution Plans Revealed

YugabyteDB optimizes UDFs by distributing work between query layer & storage servers, reducing remote calls & time spent on filtering & indexing.

YugabyteDB Supports User Defined Functions (UDFs) Like PostgreSQL

YugabyteDB supports User Defined Functions (UDFs) like PostgreSQL, enabling complex processing logic directly within SQL queries.

FerretDB With YugabyteDB: Embedded And Scalable

FerretDB now supports Embedded FerretDB, a stateless proxy. Easily embed it in your app with YugabyteDB Managed Cloud, scaling out without changes to your code.

Streamlining CI/CD With Docker Images For YugabyteDB

Streamline your CI/CD pipeline with a Docker image containing pre-installed database schema and data. Reduce image size by removing unnecessary index files.

Building YugabyteDB With Python Extension Included

Building a release of YugabyteDB with Python included: Dockerfile steps & testing PL/Python extension for HTTP calls using requests module.

Testing PostgreSQL Extensions Quickly With Docker And YugabyteDB

Test PostgreSQL extensions quickly on YugabyteDB by extracting files from a Docker image, saving time & effort in compilation.

Scaling Timeseries Data With YugabyteDB: Efficient Rollup And Purge

Rollup past data: keep summary, aggregate old events by day with YugabyteDB YSQL, reduce database size & improve visualization

YugabyteDB Vs Traditional Databases: A Counter-Intuitive Test

YugabyteDB stores tables & indexes in LSM-Trees, reducing bloat & improving performance. Automatic sharding keeps counters in different MemTables, while background compaction manages disk space & read amplification.

Using UUIDs Vs Sequences In PostgreSQL: Developer Perspective

UUIDs vs Sequences: Listen to Developers & Choose Wisely. PostgreSQL provides both, but decision depends on app design, usage & performance considerations. UUIDs offer flexibility, sequences guarantee uniqueness.

Detecting Hotspots With Active Session History In YugabyteDB

Detecting hotspots in YugabyteDB using Active Session History, a comprehensive dashboard to identify and analyze performance bottlenecks in distributed databases.

Effective SQL Indexing Strategies For Optimized Query Performance

Improve SQL indexing with column order. Don't need best index, just effective one. Follow 'How to' guide for optimal results.