JP Camara

A comprehensive guide to PgBouncer/Postgres compatibility

Thu, 25 May 2023 22:23:46 -0400

My previous post, PgBouncer is important, useful, and fraught with peril, was a deep dive into Postgres feature compatibility with different modes of PgBouncer.

I’m happy with how it came out and it was well received. I think it is the most comprehensive guide to Postgres/PgBouncer compatibility that exists. But it might be a daunting read for some (9k words 😅), and the name doesn’t clearly convey what it offers - I want it to be easily found when searching for PgBouncer/Postgres pooling questions, and be quickly digestible on the fly.

Below is a list of topics you may be looking for more information about. While PgBouncer is the most popular option, most of it applies to any of the Postgres pooling options available (Supavisor, PgCat, Odyssey, etc).

Looking to understand how different PgBouncer modes work? See session mode, statement mode, and transaction mode
Want to understand connection pooling a little better? See this section on pooling approaches
Looking at the PgBouncer compatibility table and wanting more detail on the implications of each feature? See the next section.

SQL feature map for pooling modes

Below is each compatibility table feature linked to the original post. If there isn’t a link, it means the feature works the same way with or without PgBouncer.

Feature	Session pooling	Transaction pooling
Startup parameters*	Yes	Yes
SET/RESET statement_timeout	Yes	Never
SET/RESET lock_timeout	Yes	Never
LISTEN/NOTIFY	Yes	Never
WITHOUT HOLD CURSOR	Yes	Yes
WITH HOLD CURSOR	Yes	Never
Protocol-level prepared plans	Yes	No**
PREPARE / DEALLOCATE	Yes	Never
ON COMMIT DROP temp tables	Yes	Yes
PRESERVE/DELETE ROWS temp tables	Yes	Never
Cached plan reset	Yes	Yes
LOAD statement	Yes	Never
Session-level advisory locks	Yes	Never

* Startup parameters are: client_encoding, datestyle, timezone, and standard_conforming_strings. PgBouncer detects their changes and so it can guarantee they remain consistent for the client.

** It is possible to add support for that into PgBouncer.

I’m biased, but I think it’s a pretty good read in entirety as well. There’s more sections on other gotchas, community pooling suggestions, and a look at work improving Postgres internals to lighten the need for poolers.

Thanks for reading, and I hope it answers your questions!

PgBouncer is useful, important, and fraught with peril

Wed, 12 Apr 2023 22:40:33 -0400

To start, I want to say that I’m appreciative that PgBouncer exists and the work its open source maintainers put into it. I also love working with PostgreSQL, and I’m thankful for the incredible amount of work and improvements that go into it as well.

I also think community and industry enthusiasm around Postgres is at an all time high. There are more managed hosting options than ever (Crunchy Data, Render, Fly.io, and on and on), deep extensions like PostgresML, Citus and Timescale, serverless options like Neon, and real-time services like Supabase with Postgres at their center. Postgres is a robust, advanced and fast RDBMS capable of handling the needs of most every application.

I just find the current state of recommendations and guidance around scaling Postgres to be confounding. And it feels surprising for new Postgres users to discover that one of the most common scaling options relies on a solution like PgBouncer.

Over the years I’ve read dozens of articles around scaling and maintaining Postgres databases, and they always understate the impact of PgBouncer on your application. They casually mention unusable features without any exploration, or the numerous ways you can silently break expected query behavior. The advice is just to turn it on. I want it to be clear that as your application scales, PgBouncer is often necessary but isn’t free.

The following sections provide an overview of what connection pooling is in general, how connection pooling modes work in PgBouncer and similar tools, and then I dig into every Postgres feature that does not work in PgBouncer transaction mode and what the implications are. This is the PgBouncer article I wish existed the first time I used it - let’s get going 🐘!

What is connection pooling?

PgBouncer is a lightweight connection pooler for PostgreSQL. What does that mean exactly? What is connection pooling and why is it needed?

Opening a connection is expensive: a new Postgres client connection involves TCP setup, process creation and backend initialization – all of which are costly in terms of time and system resources. A connection pool keeps a set of connections available for reuse so we can avoid that overhead past initial connection.

There are three main levels of connection pooling¹:

Framework connection pooling. This is a common feature of many frameworks/libraries. Within a given process, you maintain a pool of active connections that are shared between code, generally running across threads. Whenever you handle some processing in a server request, a background process, a job, etc, you open a connection and keep that connection open. When that piece of work finishes and a new piece of work starts, you can reuse the connection without the expense of opening a new connection to the database every single time. These connections are usually local to a particular operating system process, so you gain no benefit outside of that process (and if you’re scaling Postgres, you probably have lots of processes)

One level higher, you can have client level connection pooling outside of your code. PgBouncer can handle this, and instead of independent unsharable process-isolated connections you proxy all of your connections through PgBouncer. But it runs on your server, so you still cannot share connections between servers (and again, when needing to do it you probably have lots of servers).

Server level connection pooling. Here we host PgBouncer independent of our servers and connect to a single central PgBouncer instance². This is the most robust form of connection pooling because independent of anything else in your code or server, you are guaranteed that any client connection is coming from the pool.

Why do I need a separate tool from Postgres?

That’s all great but… why do we need it?

There are two primary layers to this:

Maintaining connections is beneficial as a base feature. Less memory and io churn, less latency before running queries. Less pressure on the database constantly opening and closing connections.
Postgres connections get expensive very quickly. Surprisingly quickly.

Here are some general community guidelines around allowable Postgres connection counts based on a mixture of community experience and specific benchmarking:

In terms of what some managed services even offer: Supabase offers a max of 50 connections, Neon offers a max of 100 connections, and Render offers a max of 397 connections.
The general upper bound recommendation is a max of 500 active connections. Services like Heroku Postgres even enforce a hard limit of 500 connections³
Even at 500 connections, your server is going to be strained. This more recent (as of 2023) enterprisedb article analyzed connection performance and found that 300-400 active connections seems optimal. This article from Brandur is older (2018) but seems to reinforce this idea as well
There have been some more recent connection improvements in Postgres (as of version 14) handling idle connections more efficiently⁴, but active connections are still expensive⁵ and idle connections have not reached the scale of a dedicated pooler
The reality of 500 connections is it sounds extremely low but those connections can handle a ton of throughput. The problem is, as a metric of pure concurrency, real connections have a hard upper limit. So if you try to have five thousand clients connect simultaneously, you’re going to start getting loads of connection errors⁶.

To improve the cost of connection overhead, general connection pooling is helpful and a PgBouncer instance in its default session based mode can handle it. But to improve concurrency things have to get a bit… quirky.

There are two modes in PgBouncer which give clients access to more connections than Postgres actually has available. They rely on the idea that at any given time many of your connections are idle, so you can free up usage of idle connections to improve concurrency.

Can I just turn on PgBouncer and get scaling for free?

Kind of? But not really? It’s complicated.

Internally, PgBouncer will manage a pool of connections for you. The default pooling mode it starts with, session pooling, is conservative, and in most cases will not provide improved concurrency⁷.

I’m going to hand wave a bit past two of the modes and focus on the typical recommendation.

Session mode is the default and most conservative mode. This is a 1:1 - your local connection truly holds onto a full connection until you close it. This does little to help you scale connection concurrency, but it helps with latency and connection churn overhead.

Statement mode is the most aggressive mode and means your connection goes back into the pool after every statement. You lose the ability to use transactions 😰 - that seems wild and unusable for only the most specialized of use cases.

The mode that results in a more sane balance of improved concurrency and retained critical database features is transaction mode. Transaction mode means your connection stays consistent as long as you’re in a transaction. Once the transaction finishes, your code thinks it still has real connection but PgBouncer actually releases the connection back into the pool internally. This is session sharing, your session is going to be shared with other connections without being reset or closed.

Transaction mode is a powerful concept. Your code in general has lots of database downtime. Most code does not solely operate on the database - it takes CPU cycles, interacts with files, makes network calls, and calls other data stores. During that time, your connection sits idle and unused for what in computing and database terms is an eternity. By releasing that back into the pool outside of transactions you free up your idle connection for use by a client who actually needs it. This way your 500 available connections can services thousands of clients, instead of a 1:1 with the number of available connections.

-- connection is actually pulled from the pool inside PgBouncer
BEGIN;
INSERT INTO...;
UPDATE;
COMMIT;
-- connection goes back to the pool inside PgBouncer

The problem with transaction mode is that this tiny configuration change quietly changes not only your ability to scale, but also the way your connections behave. It breaks the expected command semantics between client and database server. And understanding whether you’ve gotten things right in transaction mode is very difficult.

Let’s say you’ve been operating with PgBouncer in session mode (or operating without a proxy at all), and you make the switch to transaction mode. Your perspective on how you can use Postgres needs to change - so now we’re onto the peril.

Perils

Many of the following items are documented shortcomings of PgBouncer in transaction mode. But:

They’re treated lightly
Their repercussions and downsides are not discussed
PgBouncer is often recommended without mentioning them
PgBouncer is often recommended at the same time as recommending incompatible transaction mode features like session level advisory locks and session level statement timeouts
The non-determinism introduced by using incompatible statements is not discussed (ie, I execute a statement in Process A and suddenly Process B errors out due to it)

Assume anytime I mention PgBouncer after this point I am referring to transaction mode. Here we go!

Detecting invalid statements 😑

PgBouncer happily accepts statements that are not supported in transaction mode. The problem is pushed onto the developer, which means they can and will get it wrong⁸.

This is by design. The sense I get is that PgBouncer was specifically architected to not analyze any statements and so it would be a big change for them to handle this⁹.

Amazon has a similar tool to PgBouncer called RDS Proxy, and it has a feature called “connection pinning”. If it detects a statement that is incompatible with transaction mode, it will automatically hold that connection for that client for the duration of their session.

This is both highly useful and simultaneously problematic. It means query behavior is consistent with your expectations (🙌🏼) but also that you can silently kill all concurrency benefits (😑). If enough queries are run that trigger connection pinning, all of a sudden you may throttle your throughput. But it does give you an escape hatch for safely running statements which are not transaction compatible without having to jump through any hoops.

I’d be fine with some logging I could monitor. As far as I can tell there is nothing like this in PgBouncer, and so all the burden lands on you to get it right. As one engineer, or a few engineers, all aware of potential issues, you can probably maintain that. But what about dozens of engineers? Or hundreds? Thousands? All with varying levels of experience with databases and poolers? There’s going to be mistakes.

Lock Timeouts (SET/RESET) 🔓

Unless you like app downtime, you should be using a lock_timeout when running DDL. It’s a critical aspect of zero downtime migrations.

The idea is to set it to a limit that would be acceptable for queries in your application to slowdown by - waiting to acquire a lock can cause related queries to queue up behind your DDL operation:

-- slow select
SELECT * FROM my_table;

-- DDL starts in separate process, blocked on acquiring the lock by the 
--    slow query
ALTER TABLE my_table...

-- Subsequent queries start queuing up...
SELECT * FROM my_table WHERE id = 123;
SELECT * FROM my_table WHERE id = 234;
--- ...

In that scenario, the slow query is running the show. Until it finishes, all the other queries to that table are stuck. That goes on long enough and users can’t use the system. A bit longer and your app starts timing out. A bit longer you’re running out of connections. Now you’re staring at a total app outage, about ready to kill all of your connections in a desperate attempt to salvage things, contemplating a career change to landscaping where you can at most impact one person at a time, right? That sounds nice, doesn’t it?

I’ve of course never experienced that. I’m just very creative 💀. But if you have experienced that, or you’d like to avoid experiencing that, use a lock_timeout:

SET lock_timeout TO '2s';

Now if your DDL cannot acquire a lock it will throw an error after 2 seconds. That should be an ok delay in running queries, and you can retry the operation later.

But wait! Are you connected to PgBouncer?! You may want to bring up that landscaping help-wanted listing again… 🌳

SET operations apply at the session level. This means that on a PgBouncer connection, there is no guarantee our lock_timeout will still be applied when we run our DDL:

-- Process 1
-- PgBouncer pulls connection 1
SET lock_timeout TO '2s';
-- connection 1 goes back to the pool

-- Meanwhile, in Process 2:
-- PgBouncer pulls connection 3
SELECT id FROM my_table, pg_sleep(30);

-- Back in Process 1:
-- PgBouncer pulls connection 2
-- This connection has no lock_timeout set, so it will hang 
--    until our pg_sleep query finishes 30 seconds later, and all
--    queries to my_table after it are stuck for those 30 seconds as well
ALTER TABLE my_table...

It’d be easy to argue “don’t have slow queries”. And that should be the goal! But we don’t call it “happy path uptime 🌼”, we call it “zero downtime”. It means even if things go wrong, you don’t go down. There can also be other operations that hold a lock on your table, so you simply can’t rely on successfully acquiring that lock¹⁰.

So what can we do? There are two options:

Bypass PgBouncer and go straight to the database
Use a transaction level lock_timeout

Bypassing PgBouncer

Your safest bet is to go with option (1). You should have some ability to directly connect to your database, so take advantage of it and don’t jump through hoops to run DDL safely.

The biggest obstacle you hit with (1) is transparency: PgBouncer really doesn’t want you to know whether you are connected to the real database or not. There’s no easy answer there, but by validating a setup where you consistently run your DDL process directly against Postgres then you’re set.

Use transaction level statements

There is a transaction local equivalent to the SET statement: SET LOCAL. Using our example from earlier:

-- Process 1
-- PgBouncer pulls connection 1
BEGIN;
SET LOCAL lock_timeout TO '2s';
-- connection 1 stays checked out

-- Meanwhile, in Process 2:
-- PgBouncer pulls connection 3
SELECT id FROM my_table, pg_sleep(30);

-- Back in Process 1:
-- Connection 1 is still checked out
ALTER TABLE my_table...
-- lock_timeout raises an error after 2 seconds waiting, and 
--    we avoid our downtime!

DDL in Postgres is transactional, so it’s valid to start our transaction, set our lock_timeout using SET LOCAL, then start our DDL operation. Our transaction local setting will stick with us until the transaction commits or rolls back, so we safely keep our timeout and rollback our DDL.

It’s not a terrible solution (1 is still better), except for two things:

Concurrent indexes
Tooling

Another zero downtime star is the concurrent index. When you create a new index on a table you run the chance of locking it up long enough to cause downtime. Here’s the answer to that problem:

CREATE INDEX CONCURRENTLY index_name ON my_table;

Concurrent indexes are created without an exclusive lock, so your normal operations keep going while it builds the index in the background. The problem is they can’t be run in a transaction, so SET LOCAL is not an option.

Because they don’t require an exclusive lock, setting a lock_timeout is less important. But if there is contention and you just can’t get that index to acquire it’s shared lock, do you really want it to run forever?

As for (2), popular tooling usually does not handle SET LOCAL for you. In the Rails/ActiveRecord world there are several libraries that will automatically apply zero downtime policies for you, but they all assume you have an exclusive connection and operate at the SET session level.

In PgBouncer, the road to downtime is paved with session level statements.

Just go with (1), keep your sanity, throw away the diary entries about living out your days breathing in the smell of fresh cut grass, and connect directly to Postgres to run DDL with SET lock_timeout calls.

Statement timeouts (SET/RESET) ⏳

Determined not to repeat your experiences from lock_timeout, you read about this thing called statement_timeout. This little magic wand makes it so you control how long a statement is allowed to run 🪄.

So here it is:

SET statement_timeout TO '2s';

Those greedy queries now don’t stand a chance. You can tame your long running queries and avoid blocking your DDL! You ignore my advice to always use lock_timeout, say “bye losers” to long running queries, and fire off that DDL again… oh god. Why are things slowing down. Now they’re timing out. The connections are filling up. What is happening?

Oh riiiight. You forgot. You’re using PgBouncer. SET is off the table. Should have set that lock_timeout 🔐…

If I had a nickel for every time someone mentioned SET statement_timeout and PgBouncer in the same article…¹¹ I know no one sharing this content is doing it maliciously, but be aware that these are misleading and incompatible features.

With lock_timeout, why does statement_timeout even matter?

Statement timeouts are helpful for long running queries so they cancel earlier. If a client disconnects, Postgres will periodically check for the connection and try to cancel the query when it goes away. But a query with runaway cpu usage will just keep running even if the client dies or disconnects. That means you lose that connection until the query finishes, which can take minutes (or hours)
The database default is 0, which means there is no limit. In some contexts this is not a problem, but particularly for web requests this is excessive

The first time I used statement_timeout was from a blog recommendation to limit statements for requests in web applications. In a web request, you usually have an upper limit on how long you allow them to run before they time out - this conserves resources, protects against runaway buggy code and helps with bad actors. It made sense that I’d set it to something conservative on all my web connections to deal with long running queries.

I deployed the code and for a little while things seemed to work well. Then I saw something odd. This started popping up:

canceling statement due to statement timeout

But in my… background jobs? My web requests were tuned to be fast, but the constraints around my background processes were a bit… looser. Can you guess what I had recently enabled? PgBouncer in transaction mode. My session level statement timeout was being swapped out from my web request, picked up by my job, and caused my job to timeout instead - web request safety was off the rails and longer running jobs were intermittently failing.

So is there any way we can use it? There’s a couple ways I know of, but nothing great when pooling.

Our old friend transaction

BEGIN;
SET LOCAL statement_timeout '5s';
SELECT ...
COMMIT;

Something about wrapping a SELECT in a transaction feels kind of strange, but it works. If you have targeted concerns, you can wrap particular queries in a transaction and use SET LOCAL to localize the statement_timeout.

This is absolutely not a viable solution for a whole request lifecycle. If I wanted to attempt my web request level timeouts again, no way am I wrapping every web request in one giant transaction. Postgres doesn’t have a concept of nested transactions so any code I have that may be operating transactionally is gonna be in for some confusing surprises¹². And most importantly, wrapping my whole request in a transaction means I’ve completely negated the benefit of proxy pooling - now my request lifecycles are basically 1:1 with my connection sessions.

Apply statement timeouts per user

I’ve never tried it, but I’ve seen it recommended to set statement timeouts per user when using PgBouncer. That seems to have a couple problems I can think of:

It’s not dynamically configurable.
It dilutes the pool of available connections per context

(1) is definitely inconvenient. If you have different contexts where you’d like to apply different timeout constraints, this would be way too cumbersome to maintain.

But (2) feels like a deal breaker. If I want to constrain my web requests to a conservative timeout, but give my background processes more wiggle room, my pool size of real connections is now split instead of sharing a pool of total available database connections. I also have to manage making sure each context uses the appropriate user, or things will go badly.

It’s technically an option, but seems trickier to maintain and monitor.

Transparency 👻

I don’t understand why my session features aren’t working. I always make sure to use plenty of Postgr…PgBouncer?!

It is very difficult to tell when you are or aren’t using PgBouncer, which is unfortunately by design. It considers itself a transparent proxy. In session mode, that’s pretty much true. But in transaction and statement mode you are working with bizarro Postgres. It all works the same except when it doesn’t.

So if you want a regular connection because you need a feature not available in transaction mode, being sure you did it right is extremely difficult.

I have had a hell of a time verifying that some servers are or aren’t running with PgBouncer. Server A is using pub sub, I don’t want it. Server B needs the throughput, I want it. How can I make sure someone never makes a mistake and attaches the server to the wrong place? Basically, I can’t.

When it comes to production code I like to be paranoid. On a large enough codebase, and team, and user base, unusual things are bound to happen, sometimes regularly. I try to write code and configure environments so the right way is easy and the wrong way is hard. PgBouncer does not make that easy.

On this particular point I’d love to say I have some kind of advice to act on, but it mostly takes testing and validating your setup. If someone out there has better ideas or tips, I am all ears.

Prepared Statements (PREPARE/DEALLOCATE, Protocol-level prepared plans) ✔️

PgBouncer has a public relations problem when it comes to prepared statements. This is all the PgBouncer docs say about them:

Feature	Session pooling	Transaction pooling
`PREPARE` / `DEALLOCATE`	Yes	Never
Protocol-level prepared plans	Yes	No*

* It is possible to add support for that into PgBouncer

Kind of feels… alarming. No prepared statements in transaction mode?! Aren’t those… important? Even further when you go to use PgBouncer with Hibernate or ActiveRecord (and I’m sure others) you’ll see the recommendation to configure them to turn off prepared statements 😱. Does it surprise you a bit to hear that? Make you feel a little queasy maybe?

I had it drilled into me early in my career that prepared statements are a critical part of protecting against SQL injection. In the OWASP SQL Injection Prevention Cheatsheet the very first recommendation is:

Use of Prepared Statements (with Parameterized Queries)

So PgBouncer tells me I need to turn them off?

The first time I used PgBouncer in an application I spent a lot of time figuring out how turning off prepared statements was safe to do. It turns out that prepared statements in Postgres mean a few things, but come down to two main options:

Named prepared statements
Unnamed prepared statements

Named prepared statements are reusable, and are tied to the connection session.

Unnamed prepared statements are single use, and have no association to the connection session.

There are two ways to create a named prepared statement and one way to create an unnamed prepared statement:

PREPARE
Protocol-level Parse/Bind/Execute with a name specified
Protocol-level Parse/Bind/Execute with no name specified

PgBouncer says it doesn’t support prepared statements in either PREPARE or protocol-level format. What it actually doesn’t support are named prepared statements in any form. That’s because named prepared statements live in the session and in transaction mode you can switch sessions.

-- PgBouncer pulls connection 1
PREPARE bouncer_since (int, timestamp) AS
SELECT * 
FROM bouncers b
INNER JOIN guests g ON g.bouncer_id = b.id
WHERE b.id = $1 AND b.created > $2;
-- connection 1 goes back to the pool

-- PgBouncer pulls connection 2
EXECUTE bouncer_since(1, now() - INTERVAL '2 days');
-- 💣 ERROR: prepared statement "bouncer_since" does not exist 💣

But unnamed prepared statements are totally fine. In fact, I’d be shocked if the current client library you’re using to connect to Postgres does not already switch to them if “prepared statements” (again, so damn misleading) are “turned off”.

But wait. What the heck is an unnamed statement? PREPARE requires a name… how can I make a prepared statement without a name?

Protocol-level prepared plans

The alternative to the PREPARE statement is to directly communicate with Postgres at the protocol level.

I had to dig a bit to get a handle on this - I started from a common Ruby ORM called ActiveRecord, dug into the Ruby “pg” gem it uses, then went one layer deeper into libpq, which is part of Postgres itself.

If we use active record as an example, when prepared statements are “disabled”, the postgres adapter internally calls exec_no_cache in activerecord/lib/active_record/connection_adapters/postgresql_adapter.rb:

def exec_no_cache(sql, name, binds...)
  #...
  conn.exec_params(sql, type_casted_binds)

That’s powered by the ruby “pg” gem, which when calling exec_params from ruby ultimately calls into the libpq function PQsendQueryParams:

// Ruby "pg" gem
// ext/pg_connection.c
static VALUE
pgconn_async_exec_params(int argc, VALUE *argv, VALUE self) {}

// internally calls...
static VALUE
pgconn_send_query_params(int argc, VALUE *argv, VALUE self) {}

// internally calls this from the libpq c postgres internals:
// src/interfaces/libpq/fe-exec.c
int PQsendQueryParams(PGconn *conn,
  const char *command,
  int nParams,
  const Oid *paramTypes,
  const char *const *paramValues,
  const int *paramLengths,
  const int *paramFormats,
  int resultFormat) {}

What does PQsendQueryParams do? It calls an internal method named PQsendQueryGuts. Notice the empty string and use unnamed statement comment 🤔.

return PQsendQueryGuts(conn,
    command,
    "", /* use unnamed statement */
    nParams,
    paramTypes,
    paramValues,
    paramLengths,
    paramFormats,
    resultFormat);

What does that function do (aside from making me laugh every time I read the name PQsendQueryGuts 😆)? Internally PQsendQueryGuts communicates with Postgres at the protocol level:

/* construct the Parse message */
if (pqPutMsgStart('P', conn) < 0 ||
  pqPuts(stmtName, conn) < 0 ||
  pqPuts(command, conn) < 0) {}

/* Construct the Bind message */
if (pqPutMsgStart('B', conn) < 0 ||
  pqPuts("", conn) < 0 ||
  pqPuts(stmtName, conn) < 0) {}

/* construct the Execute message */
if (pqPutMsgStart('E', conn) < 0 ||
  pqPuts("", conn) < 0 ||
  pqPutInt(0, 4, conn) < 0 ||
  pqPutMsgEnd(conn) < 0) {}

This is the Parse/Bind/Execute process I mentioned earlier.

The code sends a Parse message with the query and an optional name. In our case the name is empty
The code then Binds params to that query (if the query is parameterized)
It then Executes using the combination of the parsed query and the bound params

This is perfectly safe to do in transaction mode, and from a SQL safety perspective should behave identically to a named prepared statement.

Named protocol-level statements

For comparison, when ActiveRecord has prepared statements turned on, things look a bit different, but by the end we’re in the same place:

def exec_cache(sql, name, binds...)
  #...pseudo coded a bit but importantly
  #   it calls `prepare`
  if !cached
    stmt_key = conn.prepare(sql)
  # then it calls exec_prepared
  conn.exec_prepared(stmt_key, type_casted_binds)

It first has to call prepare with whatever sql we’re going to run. The caller is in charge of keeping track of whether the sql has been prepared before, otherwise Postgres will keep overwriting our previous sql and it might as well just execute an unnamed statement. Then it calls exec_prepared with only the stmt_key, which should match the name of a previously prepared query.

If we skip ahead to what gets called in libpq:

// conn.prepare(sql)
int
PQsendPrepare(PGconn *conn,
    const char *stmtName, 
    const char *query,
    int nParams, 
    const Oid *paramTypes) {
  //...
  if (pqPutMsgStart('P', conn) < 0 ||
      pqPuts(stmtName, conn) < 0 ||
      pqPuts(query, conn) < 0) {}
  //...
}

We see something similar to our earlier Parse/Bind/Execute, but now we’re only calling the Parse portion and this time we have a stmtName. We then trigger the prepared statement calling exec_prepared, which ultimately calls PQsendQueryPrepared:

// conn.exec_prepared(stmt_key, type_casted_binds)
int
PQsendQueryPrepared(PGconn *conn,
    const char *stmtName,
    int nParams,
    const char *const *paramValues,
    const int *paramLengths,
    const int *paramFormats,
    int resultFormat) {
  //...
  return PQsendQueryGuts(conn,
      NULL,     // no sql
      stmtName, // named
      nParams,
      NULL,
      paramValues,
      paramLengths,
      paramFormats,
      resultFormat);
  //...
}

Anything look familiar? That’s the same PQsendQueryGuts function we called for the unnamed statement! This time it doesn’t hand a command in because we already parsed our SQL in the earlier prepare call. We also have a stmtName defined, instead of handing in an empty string. This version goes on to skip the Parse, call the Bind with the stmtName, then call Execute - same flow as our unnamed version.

For SQL injection safety, both named and unnamed versions are equivalent: they separate query structure (Parse) from data values (Bind). Adding query bindings when not in a prepared statement simply makes an unnamed statement.

Nothing about these calls is specific to the libpq library, it’s just a rock solid implementation of them¹³ - any language could make the same protocol calls. If a library is utilizing this protocol, they are doing the same things when binding to an unnamed prepared statement as they are when binding to a named prepared statement¹⁴.

As long as your code uses parameterized queries, “turning off” prepared statements for PgBouncer is safe, even if it seems a bit unnerving. There is a PR to allow PgBouncer to track prepared statements, so maybe this won’t cause people like me as much heartburn in the future 🥲.

Pool throughput / Long running queries 🏃‍♂️

We’ve got two types of connections to Postgres: active and idle. Idle connections are the backbone of poolers - having idle connections means we’ve got capacity to swap around transactions for connected clients. What about active connections?

An active connection means that connection is actively tied up by the database. For that timespan, the connection cannot be swapped out to do anything else until its operation completes. We know that active connections get expensive quickly, and we also know that most managed services range somewhere from 50 to 500 allowed total, non-pooled connections.

Using a max PgBouncer connection pool of 10k and Render’s managed Postgres service with a max of 397 total connections means we’d have:

10000 / 397 = ~25 connections per active connection

Using Supabase’s 50 connections the spread is even higher:

10000 / 50 = ~200 connections per active connection

That means that for every long running operation, you are potentially down 200 connections worth of pooling.

These numbers are very back of the napkin and of course do not represent the true scaling capability and connection handling of a real pooler. But the point is this:

Active connections are very valuable to a pooler
Long running queries disproportionally impact concurrency

As an example, you’re using Render Postgres fronted by PgBouncer and you’ve got 10k available connections backed by the max of 397 Postgres connections. Let’s say a new feature is introduced for populating some graph data on your app’s landing page. It’s powered by a new query that looks great, has indexes, and seems well optimized. It’s even run against some load testing and representatively sized data as a QA check. It gets deployed to production and OOF, it’s taking 15 seconds per query 🐌. Users are logging in or navigating to the landing page all the time so within moments you’ve had thousands of hits to this query. Obviously this is going to get quickly rolled back, but what does it mean for your pool in the meantime?

It means you’re maxed out. Your pooler being there means at least you’re less likely to start erroring out right away, but transaction mode can’t fix a stuck query. For each of those 15 second chunks of time your concurrency basically went from 10k back down to 397.

This is not the general behavior you’ll see when using PgBouncer unless you’ve really got some intermittent trouble with runaway queries. But it does emphasize an important point to remember: these are not real Postgres connections. Your upper bound on long running, active queries is always constrained by your actual pool of real Postgres connections.

Guarding against slow queries

Log your slow queries using log_min_duration_statement. This option lets you set a threshold and if queries take long than that threshold Postgres will log the offending query. This won’t help the sudden mass slow query situation mentioned above, but it helps to keep an eye on overall app query health
Use streaming queries sparingly. In most client libraries you can set your query to run in “single row mode”. This means you retrieve your rows one at a time instead of getting one big result set at once. This is helpful for efficiency with very large result sets but is slower than a full result set query, and probably means you are running queries large enough to be slower in the first place
Use statement timeouts. This is tricky, especially when pooling, but see that section for ideas on how to approach it
Spread out reads across read replicas

Session Level Advisory Locks 🔐

Session level advisory locks work fine in PgBouncer.

Sorry 🙈.

If you’ve read the previous sections you’ve already picked up on the pattern: “session” anything means it probably doesn’t work in transaction mode. But what does that matter to you?

Advisory locks are a great option for creating simple, cross process, cross server application mutexes based on a provided integer key. Unlike traditional locks you use/encounter elsewhere in Postgres which are tied to tables or rows, advisory locks can be created independent of tables to control application level concerns. There are plenty of other tools you could use for this job outside of Postgres, but since Postgres is already part of your tech stack it’s a convenient and simple option.

Across languages a common use case for session level advisory locks is to hold a lock while database migrations (ie, DDL) are being run. For example:

-- 1234 is arbitrary, it can be any integer
SELECT pg_advisory_lock(1234);
SET lock_timeout TO '1s';
ALTER TABLE my_table...;
INSERT INTO migrations VALUES (1234567);
-- If we don't explicitly unlock here, the lock will be held until this 
--    connection is closed
SELECT pg_advisory_unlock(1234);

If another connection went to acquire the same lock, it would be blocked:

-- This will block indefinitely until the other connection is closed, 
--    or calls pg_advisory_unlock(1234)
SELECT pg_advisory_lock(1234);

This is largely an attempt to improve consistency of migration tracking, and help coordinate multi process deploys:

Continuous deployment with the potential to trigger multiple deployments in succession
Propagating code changes to multiple servers with deploy scripts automatically triggering migrations in each context

By waiting to acquire a lock at the Postgres level, each process waits for the first lock owner to finish before continuing, coordinating each process based on a shared lock key.

Once more, with feeling PgBouncer

Now for the obligatory example of trying the same thing when connected to PgBouncer 🫠:

-- Grab the lock on connection 1
SELECT pg_advisory_lock(1234);
-- Connection 1 goes back into pool
-- ...
-- Try to unlock on connection 2, which does not own the 1234 lock
SELECT pg_advisory_unlock(1234);
-- WARNING: you don't own a lock of type ExclusiveLock

We try to unlock, but because we’re on a different connection we can’t. The lock stays locked for as long as connection 1 stays alive, which means now no one else can acquire that lock unless that connection naturally closes at some point or is explicitly pg_cancel_backended 😓.

More session advisory lock use cases

Outside of migrations, advisory locks can serve other use cases:

Application mutexes on sensitive operations like ledger updates
Leader election for maintaining a single but constant daemon operation across servers
Exactly once run job controls for Postgres based job systems like GoodJob and Que

If these things sound interesting or useful, they are! But only if you connect directly to Postgres.

Transaction level locks

Advisory locks do have a transaction based companion:

-- Process 1
BEGIN;
SELECT pg_advisory_xact_lock(1234);

-- Process 2 
-- Blocks while process 1 is in the transaction
SELECT pg_advisory_lock(1234);

-- Back in Process 1
SET LOCAL lock_timeout TO '1s';
ALTER TABLE my_table...;
INSERT INTO migrations VALUES (1234567);
COMMIT; -- automatically unlocks on commit or rollback
-- Process 2 now can acquire the lock

-- If you need to manually unlock while still in the transaction 
-- SELECT pg_advisory_xact_unlock(1234);

You could use it as a replacement for certain scenarios, like the above migration operating transactionally. For custom purposes, it’s a good alternative!

Unfortunately most migration tooling, things like leader election, and request or job lifetime locks, all use or require a longer lived lock than a single transaction could reasonably provide.

Turn off advisory migration locks

If you need to run migrations against PgBouncer, in Rails you can turn them off with an advisory_locks flag in database.yml. Other migration tools likely have something similar. Do it at your own peril 🤷🏻‍♂️

Maintaining a separate direct connection to Postgres

If the lock is critical, but the operations past the lock fan out and acquire multiple connections, you could potentially have two pieces:

A direct connection to Postgres where you acquire a session level advisory lock
Your normal code level connection pooling using your PgBouncer connections so it can capitalize on the scaling opportunities provided there

There’s an obvious downside - you’re consuming an extra direct connection and potentially impacting throughput - but it’s an alternative available if needed.

Listen / Notify 📣

Postgres comes out of the box with a handy pub/sub feature called LISTEN/NOTIFY.

You simply call:

LISTEN channel_name;

And that connection will receive NOTIFY events:

NOTIFY channel_name, 'hi there!';

Like session level advisory locks, there are more robust pub/sub solutions out there. But the Postgres implementation works well, and you already have it available in your stack.

Looking at the example, you’ll notice that the LISTEN call is just a single statement, and it activates the listener for the current session. What have we said so many times already? Sessions bad. Transactions good… kind of.

kind of?

Similar to prepared statements, the docs are misleading when it comes to LISTEN/NOTIFY.

PgBouncer officially lists LISTEN/NOTIFY as an unsupported feature in transaction mode, which is not precisely true. LISTEN does not work in transaction mode, but NOTIFY does.

NOTIFY is a single statement, and doesn’t rely on any session semantics. It’s also transactional¹⁵:

BEGIN;
NOTIFY channel_name, 'hi!';
ROLLBACK; -- no notification is sent

Both NOTIFY formats (inside and outside a transaction) work fine with transaction mode pooling. If you want to use pub/sub, you just need to make sure your LISTENer is connected directly to Postgres. Since it can be hard to tell if you’re connected to Postgres or PgBouncer this is somewhat tricky, unfortunately.

I’ve built implementations LISTENing on a non-PgBouncer connection and NOTIFYing on PgBouncer that work fine. There’s not much writing on this, but I have found this approach to work well.

The single thread 🪡

In contrast to the multi process monster that is Postgres, PgBouncer runs on a paltry single process with a single thread.

This means that no matter how capable a server is, PgBouncer is only going to utilize a max of one CPU core so once you’ve maxed out on that core you can’t scale that single instance anymore.

A popular option is to load balance PgBouncer instances. Otherwise, almost every alternative to PgBouncer (like Odyssey, PgCat and Supavisor) utilize multiple cores.

If you’re using a managed Postgres service (like Crunchy Data, Supabase, Neon or Heroku), your default option is going with PgBouncer as a connection pooler - so it will be up to those services to offer a load balanced option.

pg_dump 🚮

If you’re running pg_dump against PgBouncer, it’s probably by mistake.

As far as I can tell, pg_dump is broken when run against PgBouncer. See https://github.com/pgbouncer/pgbouncer/issues/452.

The answer here is to make sure you’re using a direct connection to Postgres for utility operations like pg_dump.

Other unavailable features 🫥

There are some remaining features which transaction mode is incompatible with as well¹⁶. I have less or no experience with these:

WITH HOLD CURSOR - A WITH HOLD continues to exist outside of a transaction, which seems like it could have handy use cases but I’ve never personally used it in my day to day.
PRESERVE/DELETE ROWS temp tables - temporary tables are a session level feature so will not work properly, and preserve/delete rows are modifiers on how those temporary tables behave on commit, and are unsupported
LOAD statement - this is for loading shared libraries into Postgres, so it makes sense this is not something you should be doing through a pooler. I haven’t actually tried, so I’m not sure if PgBouncer would stop you, but it requires super user privileges so it’s very unlikely that’s what your PgBouncer user has

PgBouncer documents a simple “SQL feature map for pooling modes” where you can see all the features mentioned in this post.

Linting 🧶

Aside from having identified potential issues - what can we do to avoid them in an automated way?

Surprisingly, not much exists. And by not much, I mean i’ve found nothing outside of advice.

It makes me feel a bit like I’m exaggerating the importance of these issues. Maybe I’m the oddball that has actually encountered many of them in real production usage and had to address them. I’ve had statement timeouts and lock timeouts misapplied. I’ve had to deal with rearranging connections because of code using a session advisory lock and LISTEN/NOTIFY, or drop libraries that use them. I’ve had to remember to turn off prepared statements in my ORM to avoid named prepared statement errors.

The implications can feel small, but they can be surprising and particularly around migrations can cause real serious downtime.

We lint everywhere. As engineers we try to automate away as many mistakes as possible with linting and specs. As development teams grow, the importance of automation becomes critical to scaling because otherwise someone somewhere is going to do the wrong thing and it won’t get caught.

Some ideas that would be great to see:

PgBouncer optional process that detects bad queries and logs them
RDS connection pinning behavior
Static analysis tools for app queries
Runtime extension to client libraries
Making sure your development flow runs PgBouncer locally to try and encounter this behavior before running on production

In the rails world there are several active gems devoted to keeping a codebase safe from issues that would cause downtime while migrating tables (ie, zero downtime). But across ecosystems I could not find anything related to protecting against PgBouncer issues.

As a step in this direction, I’ve published a (currently experimental) gem for use in Rails/ActiveRecord apps called pg_pool_safe_query. It will log warnings if SQL is run that is incompatible with PgBouncer and raise an error if advisory locks and prepared statements are not disabled.

Can we improve connections without a pooler?

A more recent development in Postgres 14 was improvements to snapshot scalability, which seem to have resulted in big improvements in efficiently maintaining more idle connections in Postgres.

It’s exciting to see effort being applied to increasing connection efficiency in Postgres itself. The author of that snapshot scalability improvement lines up with my own frustrations:

Ideally Postgres would better handle traffic spikes without requiring a pooler
Poolers cut out useful database features
Postgres itself would ideally move towards architecture changes across several key areas, eventually culminating in a larger move towards a lighter weight process/thread/async model which better aligns with the C10k problem out of the box

Most of the work in the industry seems to concentrate on building better poolers, rather than improving the internals of Postgres connection handling itself¹⁷. Outside of PgBouncer you’ve got RDS Proxy, Odyssey, PgCat, Supavisor, PgPool II and I’m sure others. All have their own benefits but suffer from the same transactional scaling limitations.

In fairness to the incredible work that goes into Postgres - every performance improvement they make in every new version is also a connection scalability improvement. If the queries, indexes, plans, and processes are making big performance gains with each version then less connections can do more.

PgBouncer alternatives

There are alternatives to PgBouncer, but the same transaction limitations apply to all of them: each has a transaction mode (or operate exclusively in transaction mode) that offers the best scaling. Once in transaction mode you can’t support most session level features anymore and you’re working off of the fact that database connections spend more time being idle than active.

They all have their own unique benefits in comparison, but have the same fundamental transaction limitations.

Am I finally done with this post?

I think I’ve said enough.

Postgres is great. PgBouncer is important. Know what can go wrong and account for it.

🐘 ✌🏼 🐘

This article from Brandur details some additional nuances of handling connections and pools, but these three are the higher level version of it ↩︎
Technically it doesn’t have to be a single instance, it could be a round Robin of multiple PgBouncers, but from a client perspective you connect to a single one

https://www.crunchydata.com/blog/postgres-at-scale-running-multiple-pgbouncers ↩︎
It’s even lower on their lower powered options. It goes from 20, to 120, to 400, then 500 once you’re around their $400/mo plans.

Supabase has no standard plans with more than 50 connections.

Render.com’s managed Postgres offering is based on memory available on each plan: 6 gigs or less is 97 connections, less than 10 gigs is 197 connections and over 10 gigs is 397 connections.

This isn’t totally unreasonable - managing more connections requires more cores and more memory in a process based model especially. But at their highest tiers these services don’t exceed 500 available connections.

More generalized services like Azure and Amazon RDS will let you go as high as you like, but that’ll go badly. ↩︎
Which is very exciting to see concentrated work on improving this aspect in Postgres internals! 🤘🏼 ↩︎
This more recent crunchy data article on making sure your Postgres app is production ready implies the old standard of 500 connections is no longer accurate so I’d be curious to know more specifics since most resources still emphasize these numbers https://www.crunchydata.com/blog/is-your-postgres-ready-for-production ↩︎
AFAIK this is largely true of any database and MySQL also has connection pooling solutions, but it does seem to be particularly necessary with postgres ↩︎
There are a couple caveats to this statement. Just having a dedicated low latency pool is an improvement so may slightly help concurrency. PgBouncer can also proxy multiple databases so you could increase read concurrency at least this way.

The queueing behavior of poolers can also be a benefit since you can wait for a connection to be available for longer, vs Postgres instantly rejecting the attempt: https://www.percona.com/blog/connection-queuing-in-pgbouncer-is-it-a-magical-remedy/ ↩︎
They do mention this in their docs:

> “Note that “transaction” pooling breaks client expectations of the server by design and can be used only if the application cooperates by not using non-working features.” ↩︎
https://github.com/pgbouncer/pgbouncer/issues/653

https://github.com/pgbouncer/pgbouncer/issues/249

It also just seems to be something they’re not interested in doing ↩︎
The idea is you fail and continue to retry. See this article for some framework agnostic approaches to retries: https://postgres.ai/blog/20210923-zero-downtime-postgres-schema-migrations-lock-timeout-and-retries ↩︎
I might have five nickels, but still, it happens. Also again I am grateful for anyone taking the time to write up content and share their expertise. ↩︎
That’s a topic for another time… ↩︎
In addition to the Ruby pg gem It’s used by the python psycopg lib, and node node-libpq package (and I’m sure many others). So it seems like most client libraries handle things safely enough at the protocol level to turn off prepared statements

If you are using Go with the pure Go lib/pq driver see this issue for how to properly handle unnamed statements. The rust sqlx library seems to have a similar issue. Seems that if a library does not use libpq they end up in a bit of pain when trying to work with PgBouncer ↩︎
Named prepared statements can boost performance for repetitious queries because they bypass the Parse call on subsequent runs. That’s their primary benefit in comparison to unnamed statements ↩︎
LISTEN can be called in a transaction as well, but all that means is the session level listen won’t be triggered until the transaction commits, and won’t start listening at all if a rollback is triggered ↩︎
I do find myself asking “what is the point of nice features if you can’t use them at scale because of transaction mode pooling”? Not being able to use certain features at scale should never preclude them from being built - but it’s a disappointing reality ↩︎
I’m a little afraid to have made this statement and the potential for someone to come back at me angry about this being an oversimplification 😅 ↩︎

Making Tanstack Table 1000x faster with a 1 line change

Tue, 07 Mar 2023 18:10:00 -0400

A few months back I was working on a Javascript frontend for a large dataset using Tanstack Table. The relevant constraints were:

Up to 50k rows of content
Grouped by up to 3 columns

Using react and virtualized rendering, showing 50k rows was performing well. But when the Tanstack Table grouping feature was enabled, I was seeing slowdowns on a few thousand rows, and huge slowdowns on 50k rows.

It might have gone unnoticed if it was 100ms slower, or even 500ms slower. But in the worst case renders would go from less than a second without grouping up to 30-40 seconds with it.

Tracking down the issue

Initially I tried using the chrome Javascript profiler, but it can be tough to use when performance is so slow. The profiler adds noticeable overhead to your code, and since the code took 30-40 seconds already, it was basically unusable¹.

Unable to use the profiler, I reached for an old, simple standby: console.time². This is a convenient way to see how long a section of code takes, logged to your console:

console.time('expensive code');
thisIsExpensive();
console.timeEnd('expensive code');
// console.time
//   expensive code: 1000ms

A side note about optimizing code: as programmers we are full of ideas about what needs to be optimized and are terrible at getting it right.

We make educated guesses at what is important to optimize and where the problem is, and until we measure we’re usually wrong. I try now to wrap everything possible in a benchmark to make sure I’m even in the right place, then start narrowing the benchmark inside of the code from there.

When tracking down a performance issue this would be a general outline:

console.time('everything');
elements.forEach(() => {
  console.time('methodCall');
  methodCall(() => {
    console.time('build');
	build();
	console.timeEnd('build');
  });
  console.timeEnd('methodCall');
});
console.timeEnd('everything');
// build      49ms
// methodCall 50ms
// build      51ms
// methodCall 52ms
// everything 102ms

Back to the table rendering

As usual, before measuring, all of my guesses at the potential performance problem were wrong.

My own code was fine, so this was a case where it was actually a library bug, which was a surprise. There was no code path I could find that was performing poorly - all of the time was spent in the library. When using Tanstack table in React all of the logic happens in a pretty centralized place - the useReactTable hook - so it was easy to see the time was all there

console.time('everything');
customCode();
console.time('useReactTable');
useReactTable(...);
console.timeEnd('useReactTable');
console.timeEnd('everything');
// useReactTable 31500 ms
// everything    31537 ms

One of the nicest things about developing Javascript with packages is that at any time I can open up the node_modules folder and play around with third party code. In this case I was able to modify the Tanstack Table source code directly to add some timing information.
Turning on grouping was when everything slowed down, so it made the most sense to start timing that code

This is an abbreviated version of the grouped row source code, with my first pass at timing what I thought were the likeliest culprits. Pay attention primarily to console.time statements - you don’t have to understand everything going on.

function getGroupedRowModel<TData extends RowData>() {
  console.time('everything');
  //...
	
  console.time('grouping filter')
  const existing = grouping.filter(columnId =>
    table.getColumn(columnId)
  )
  console.timeEnd('grouping filter')
	
  const groupUpRecursively = (
    rows: Row<TData>[],
    depth = 0,
    parentId?: string
  ) => {
    if (depth >= existing.length) {
      return rows.map(row => {
      row.depth = depth
      //...
      if (row.subRows) {
        console.time('subRows')
        row.subRows = groupUpRecursively(
          row.subRows, 
          depth + 1
        )
        console.timeEnd('subRows')
      }
      return row
    });
	
    const columnId: string = existingGrouping[depth]!
    const rowGroupsMap = groupBy(
      rows, 
      columnId
    )
	
    const aggregatedGroupedRows = Array.from(rowGroupsMap.entries()).map(([groupingValue, groupedRows], index) => {
      let id = `${columnId}:${groupingValue}`
      id = parentId ? `${parentId}>${id}` : id
      console.time(
        'aggregatedGroupedRows groupUpRecursively'
      )
      const subRows = groupUpRecursively(
        groupedRows, 
        depth + 1,
        id
      )
      console.timeEnd(
        'aggregatedGroupedRows groupUpRecursively'
      )
      //...
      }
    }
  }
  console.timeEnd('everything');
}

My hunch was that the groupUpRecursively function was to blame - it made logical sense that tens of thousands of recursive calls could cause a slowdown (spoiler: as usual, I was wrong 😑):

First pass was a bust - it logged thousands of the subRows timers - every iteration was fast and there were too many of them to be useful so I cut it.

console.time
 subRows: 0 ms
   at map (packages/table-core/src/utils/getGroupedRowModel.ts:48:25)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)

console.time
 subRows: 0 ms
   at map (packages/table-core/src/utils/getGroupedRowModel.ts:48:25)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)
       at Array.map (<anonymous>)

Removing that, I started to get closer. I was accounting for most of the time but I had two problems: I wasn’t accounting for all of the time (everything was 33 seconds and groupUpRecursively was only 23 seconds) and the chunk of time I was logging was too large to usefully identify the problem code:

console.time
  grouping filter: 1 ms
    at fn (packages/table-core/src/utils/getGroupedRowModel.ts:22:17)

console.time
  aggregatedGroupedRows groupUpRecursively: 23248 ms
    at map (packages/table-core/src/utils/getGroupedRowModel.ts:71:23)
        at Array.map (<anonymous>)
        at Array.map (<anonymous>)
        at Array.map (<anonymous>)

console.time
  everything: 33509 ms

I realized I had missed a function call - a little unassuming function called groupBy - so I added a console.time block there next:

console.time('groupBy')
const rowGroupsMap = groupBy(rows, columnId)
console.timeEnd('groupBy')

Got it! Almost the entirety of the 31 seconds was concentrated into 3 calls to groupBy.

console.time
  grouping filter: 2 ms
    at fn (packages/table-core/src/utils/getGroupedRowModel.ts:22:17)

console.time
  groupBy: 10279 ms
    at groupUpRecursively (packages/table-core/src/utils/getGroupedRowModel.ts:59:19)

console.time
  groupBy: 10868 ms
    at groupUpRecursively (packages/table-core/src/utils/getGroupedRowModel.ts:59:19)
        at Array.map (<anonymous>)

console.time
  groupBy: 10244 ms
    at groupUpRecursively (packages/table-core/src/utils/getGroupedRowModel.ts:59:19)
        at Array.map (<anonymous>)
        at Array.map (<anonymous>)

console.time
  aggregatedGroupedRows groupUpRecursively: 21159 ms
    at map (packages/table-core/src/utils/getGroupedRowModel.ts:71:23)
        at Array.map (<anonymous>)
        at Array.map (<anonymous>)

console.time
  everything: 31537 ms

For each grouped column, it was calling groupBy and each call took roughly 10 seconds.

So what the heck was going on in the groupBy function that was causing such a massive slowdown. Can you pick it out?

function groupBy<TData extends RowData>(rows: Row<TData>[], columnId: string) {
  const groupMap = new Map<any, Row<TData>[]>()
	
  return rows.reduce((map, row) => {
    const resKey = `${row.getValue(columnId)}`
	const previous = map.get(resKey)
	if (!previous) {
	  map.set(resKey, [row])
	} else {
	  map.set(resKey, [...previous, row])
	}
	return map
  }, groupMap)
}

I started chopping up the function. I switched the Map to be an object literal in case that was causing some kind of memory overhead and tried changing to a for loop instead of reduce in case the amount of iterations plus closures was causing an issue³.

Those had no effect so next I started commenting out lines of the function. When I commented out this line, all of a sudden everything started finishing instantly:

// map.set(resKey, [...previous, row])

What was the purpose of that line and why was it so slow?

const resKey = `${row.getValue(columnId)}`
const previous = map.get(resKey)
if (!previous) {
  map.set(resKey, [row])
} else {
  map.set(resKey, [...previous, row])
}

On each iteration of the reduce call, the code:

Used the value of that column cell as a map key. Let’s say the value is the string “New York”
If there was no value associated with “New York”, it would set a value of the current row wrapped in an array
If there was already a value, it would use the Javascript spread operator to concatenate the current row onto the end of the previous array value
This means that on each iteration of reduce, the spread operator was creating a new, incrementally larger array, getting slower with each iteration⁴

// 1st iteration
[...previous, row] => [1,2]
// 2nd iteration
[...previous, row] => [1,2,3]
// 3rd iteration
[...previous, row] => [1,2,3,4]
// ...
// 50,000th iteration
[...previous, row] => [1,...49998,49999]

What does spread do behind the scenes?

In our case the spread operator takes an iterable object and loops over it to create new array. It’s probably more nuanced than this, but the spread operator is likely O(n). That means that as the size of the array grows, the longer it takes to spread the values into a new array.

I’m certain there are additional efficiencies provided in the language internals but the following code is essentially equivalent⁵:

const a = [1, 2, 3, 4, 5, 6, 7];
	
// spread
const b = [...a, 8];
	
// manual loop
let c = [];
	
for (let i = 0; i < a.length; i++) {
  c.push(a[i]);
}
c.push(8);

Which means that the original version of that groupBy code was equivalent to this:

// original code
map.set(resKey, [...previous, row]
	
// manual spread
for (let i = 0; i < rows.length; i++) {
  const tempPrevious = [];
  for (let j = 0; j < previous.length; j++) {
    tempPrevious.push(previous[j]);
  }
  tempPrevious.push(rows[i]);
  previous = tempPrevious;
  map.set(resKey, tempPrevious);
}

Seeing the manual spread, one thing sticks out immediately: we have a nested loop.

A nested loop is a good shorthand for identifying one of the slower forms of algorithmic complexity: a “Big O” complexity of O(n^2)⁶. This means that for an array size of 50,000, the number of iterations required would be 50,000^2, or 2.5 billion iterations 😱.

In our specific case the array started out empty and grew to a size of 50,000, so it’s wasn’t quite as bad. Technically it was still a complexity of O(n^2), but practically it was O(n^2/2)⁷. Either way it still meant:

1,249,975,000, or 1 billion 249 million iterations
49,999 discarded arrays allocated for 1,249,925,001 entries
Times three groupBy calls that means 3,749,925,000 iterations, 149,997 discarded arrays and 3,749,775,003 allocated entries

Pour one out for the garbage collector and Javascript runtime 🙅🏻‍♂️☠️🙅🏻‍♂️. Honestly, for how many iterations and how much garbage we accumulated, it was operating pretty well 🫠.

So the question was - what could be done to improve it?

Spread is an immutable pattern, so should the code be kept immutable?

Aside from providing a convenient syntax for combining iterables, the spread operator also allows us to keep code immutable. This can be beneficial because immutable patterns mean we avoid mutations, which means we don’t change data from underneath other code that is using it.

If this code was written to be immutable, there were some other options we could try to improve performance which would behave the same as the spread operator:

// Array.from()
//   ~10 seconds, just as slow 😒
const arr = Array.from(previous)
arr.push(row)
map.set(resKey, arr)
	
// slice()
//   ~10 seconds, just as slow 🙃
const arr = previous.slice()
arr.push(row)
map.set(resKey, arr)
	
// concat(row)
//   ~10 seconds, just as slow 🧐
map.set(resKey, previous.concat(row))
	
// hand written for loop
//   ~14 seconds, slower than spread! 😱 
let arr = []
for (let i = 0; i < previous.length; i++) {
  arr.push(previous[i])
}
arr.push(row)
map.set(resKey, arr)
	
// concat([row])
//   ~4 seconds 🏃‍♂️ 
map.set(resKey, previous.concat([row]))
	
// using the `immer` node package
//   ~100ms 🚀 
return produce(groupMap, (draft) => {
  return rows.reduce((map, row) => {
    const resKey = `fixedKey`
    let previous = map.get(resKey)
    if (!previous) {
      map.set(resKey, [row])
    } else {
      previous.push(row)
    }
    return map
  }, draft)
})

Array.from, slice, and concat all had the same performance characteristics as the spread operator. There were a few surprises however:

A manual for loop was the slowest option, coming in at around 14 seconds. The native code versions of these methods are clearly much more optimized than the manual loop we can write
concat, when handed an array as the argument instead of a plain object, was consistently faster than spread on any v8 platform (node/chrome). It took half the time! Either there was a hard to find mistake in my code or the runtime had a special optimization for this use case. Still too slow however
Using immer performed significantly faster than all other options. immer provides regular Javascript interfaces on immutable data using proxies and structural sharing to make efficient changes to data structures without changing the original source object. However in this case the main benefit of the immer approach is that we were not copying any arrays - just pushing directly to them because they were created inline in our function

But did we even need immutability at all? Was the immer approach good enough? Could we get even faster?

Using push directly

For performance, you can’t beat mutations.

Mutations modify data in place, without copying and creating duplicated data. Looking over our groupBy function, there wasn’t any benefit to using immutable operations. Everything was created in the function so we could safely mutate it before we returned it. I don’t think the original code was written with immutability in mind - using the spread syntax was just convenient.

When using array push, we are effectively running with an algorithmic complexity of O(1). That means as our array grows, the cost of inserting new elements does not. It’s constant time: it’s as fast for 1 item as it is for 50,000.

Here’s the final version of the fix, now 1000x faster for my use case:

function groupBy<TData extends RowData>(rows: Row<TData>[], columnId: string) {
  const groupMap = new Map<any, Row<TData>[]>()
	
  return rows.reduce((map, row) => {
    const resKey = `${row.getValue(columnId)}`
    const previous = map.get(resKey)
    if (!previous) {
      map.set(resKey, [row])
    } else {
      previous.push(row)
    }
    return map
  }, groupMap)
}

Can you spot the difference? It’s this one simple change:

previous.push(row)

previous.push(row) mutates the existing array and brings our time down from 10 seconds per groupBy down to around 10ms, 1000x faster than the original code 🚀.

The takeaway here is that while spread is an expressive and convenient language feature, it’s important to see it for what it is at its core: a for loop. If you see a spread inside of a loop, you’ll want to rewrite it if there is any chance it will operate on a large set of values.

Source code

You can see the full PR here https://github.com/TanStack/table/pull/4495. Because it was such a small fix, the majority of the PR is just a spec to guard against a future performance regression.

And here’s the source code for each attempt, runnable on the Replit platform. If you run it there the timings are much slower because it is running a lower powered CPU than my local machine, but as a relative measure the results are consistent with the numbers in this post.

I also find it can be hard to get a great sense for a profile when profiling react code since certain internal react code paths are hit so often the results are tough to decipher ↩︎
It’s non standard, but in the future I need to remember to reach for console.profile as well.

That could have been helpful here for profiling smaller blocks of code rather than deciphering the whole React render stack.

Considering how slow the code was it still may not have been usable anyways.

You can learn more about it here: https://developer.mozilla.org/en-US/docs/Web/API/console/profile ↩︎
What I’ve found is that Javascript engines have gotten so optimized that a forEach/map/reduce call can be as efficient as a hand written for loop. They’re almost never the performance problem in Javascript, least of all in a large set of iterations where the JIT can aggressively optimize. ↩︎
I should note that the performance was at its worst for poorly distributed groupings. By that I mean if you had a grouped column with only one value then the grouping is concentrated into one key with all 50,000 values. If there were lots of unique values in the column the arrays were smaller and built more quickly ↩︎
The v8 internals are, unsurprisingly, very complicated. Logic for spread seems to exist in a variety of places because it’s such a core feature so I wasn’t able to track down a specific implementation ↩︎
If “Big O” complexity doesn’t mean anything to you, there’s lots of great resources online for it and it’s a great concept to be familiar with. The basic idea is it helps you identify how costly a particular piece of code will be to run.

For a deep dive into it, https://frontendmasters.com/courses/algorithms/ is a great free resource ↩︎
In terms of algorithmic complexity, O(n^2) and O(n^2/2) are considered equivalent, since you generally drop the constant (in this case, 1/2).

But from a practical perspective, if you’re going to have a slow algorithm you’d still rather have it be half as slow as it’s full potential slowness 😅 ↩︎

Kicking the social media habit with “one sec”

Thu, 09 Feb 2023 18:30:54 -0400

Twitter. Oh how I could stroll that infinite corridor of information.

Ever since having kids my personal time has been squeezed into a tiny ball. Still, I’d finish putting the kids to bed, open my phone, and where would I end up? On Twitter for 45 minutes. An hour. Hour 30. I’d sit there feeding whatever meager morsels of free time I had left to that tiny blue bird, it gobbling them up like some kind of attention soaked black oil sunflower seeds¹.

Twitter was my go to. I could certainly get lost on Instagram, or TikTok. But something about those platforms felt more wasteful. I wasn’t doing anything “worthwhile” there. Being on Twitter I could create warped justifications that I was actually learning things from people. Many times I was. But it was just as easy to get lost in pointless wanderings, sucked into controversy I didn’t care about 5 minutes earlier. And even when I learned something, was it best use of my time?

During one of these info dump excursions, I saw a tweet about an app called “one sec”. It got me curious - taking a deep breathe, delaying distracting apps… adding friction?

… isn’t that basically Screen Time?

It sounded interesting, but was it really any different from screen time limits?

I have had screen time limits on social media since that feature first released on iOS. At worst it is a minor annoyance that I hardly notice myself dismissing. At best it occasionally saves me after already wandering aimlessly for ages.

My rock solid screen time workflow:

You’ve reached your limit!
tap “Remind me in 15 minutes”
You’ve reached your limit!
tap for 15 more
You’ve reached your limit!
Ugh, I just want to finish reading one thing - I can probably do it in 1 minute?
You’ve reached your limit!
…give up and tap “Ignore limit for today”²

So what about “one sec” was going to make that any different?

The difference I found with “one sec” is specifically related to what they advertise: added friction. In full they describe it as “added friction makes distracting apps less appealing.”

Screen time does not make apps less appealing. Screen time is almost completely frictionless. How much friction does quickly tapping a screen cause? I’ve extended my screen time so many times over the years I probably could tap each option by muscle memory.

“One sec” is all friction. Here’s an example of what it’s Iike when you try to open an app connected to “one sec”:

That’s actually sped up a bit - the default is a 6-10 second animation. See how you are forced to sit and wait? That waiting is the key. It’s the friction they describe. I literally cannot get into that app any faster, and then I still have to decide whether I move forward once I finish waiting.

You sit there for 10 seconds - it’s intentionally aggravating, and it forces you to question how committed you are to opening the app. Does it really matter to me?

Here’s my new workflow with “one sec”:

Open app
“One sec” pops open
Take a deep breath. Wait for the screen to reveal my options
Phew, still waiting
This is only 10 seconds? Do I really want to open the app this badly?
Ok, moment of truth. Close or continue on?
Here I usually just drop out, but sometimes I “Continue to ‘app’”. Now I still have to make one more decision. What’s the reason I’m using it?
Not worth it, I’m out 👋🏼

For the rare times I actually stick around, I love the intention prompt. If the initial friction of waiting isn’t enough, here is one more layer of confronting my habit. You have a base set of intentions, and I’ve created a custom prompt for the one thing I ever do anymore on Instagram.

I don’t remember if this was on by default, but if it isn’t you should absolutely enable it through “Intention Tracking”

Is there anything more to it than friction?

Why would friction be so much more effective than screen time limits? Why would it be more powerful than just willing yourself to leave social media?

This excerpt from James Clear’s “Atomic Habits” sums it up really well:

Your environment is more important than your willpower

It is important to remember that the environment drives our good behaviors as well as our bad ones. People who seem to stick to good habits with ease are often benefitting from an environment that makes those behaviors easier.

Atomic Habits - Chapter 12

“One sec” changes your environment. Your typical phone environment is every app instantly available with a tap. With “one sec”, you are forced to evaluate that tap instead of mindlessly hopping around.

“One sec” has also been analyzed in two different academic studies about its impact on social media usage:

Social media usage cut in half through “one sec” intervention³

Friction causes behavioral change

By taking out the instant gratification, the urge for unintentional social media usage fades away over time: user’s brain now connects opening Instagram with something unpleasant, having to wait 10 seconds during the breathing intervention.

The takeaway: instant gratification (tap in and space out) is replaced with an unpleasant experience (waiting and being forced to acknowledge your own motivators). You are actually re-wiring your brain through this change. I can attest to that - the thought of opening apps and waiting through the “one sec” interstitial is pretty unpleasant, and stops me before I even consider opening them.

How do I use it?

For such a powerful app, the setup is a bit clunky.

As far as I can tell this isn’t “one sec”s fault - it’s likely not possible to block one app with another app just by having it installed. If it was, you could completely block a persons phone by convincing them to download your app.

To enable “one sec” you need the one sec app and the Shortcuts app. You create a personal automation that triggers “one sec” any time you go to open a specific application.

As of writing this I’m up to 8 automations connecting to “one sec”:

The biggest benefit you’ll get out of it is by using the “pro” version, which currently costs me about $16/yr. It’s absolutely worth it, but you can also use the free version to try out the behavior against a single app and see how you like it.

“One sec” displays the number of times it’s prevented you from opening each app, and estimates time saved. It’s interesting to analyze preventions, though it’s saved me way more time than it takes credit for.

The one sec site has great documentation, and the app offers videos on setup, so you can read and watch those once you download it.

What I’ve been doing instead

The clearest change for me is that I’m even writing this. It’s been years since I’ve written anything, and I’ve invested some of my time into a new writing workflow.

As well I’ve been learning a new programming language, reading books, exercising, and just generally formulating uses of my time that align with my values.

Social media, even when not all consuming, is a time suck. You probably have alternatives that would make you happier if only you left these platforms⁴.

Oh and I’ve been sleeping better. I have less options to distract me if I wake up at night - it’s a beautiful thing.

Throwing the baby out with the bath water⁵

Social media may be a huge time suck, but it’s not 100% useless. I think most people find some slices of learning, inspiration and connection on it somewhere.

For me, Twitter was often a valuable learning resource. It was a useful tool for keeping up with technology trends that matter to me personally and professionally. Unfortunately, I can’t disentangle the good (learning) from the bad (aimless wandering). Social media is designed to suck us in and keep us there as long as possible - I can’t fight that programming, I can only avoid it.

As a way to keep up with folks I follow without using Twitter directly, I’ve been using Mailbrew. It allows me to setup a digest based on my timeline and I can also choose specific Twitter handles to keep up with. It doesn’t recreate the experience of spontaneous discovery, but that means it also doesn’t trigger the accompanying variable ratio schedule⁶ with it.

Going even deeper with “one sec”

In writing about “one sec”, I’ve dug into the app quite a bit. I’ve found that “one sec” is deeply configurable and filled with additional features. Blocking apps was my main draw but it has a variety of other features including:

Blocking websites with a safari extension
Hooking into focus modes to completely block apps during those times
Starting a “block session” in the app which blocks every app and website you have configured
Triggering “don’t get lost” notifications if you’ve gone through to an app and have been there for awhile

“One sec” can do even more and I’ve found myself expanding not only what it enables but also what apps I apply it to. I now have periods where I apply “one sec” to email, slack and web browsers. I have my sleep focus setup to completely block them so I can’t wake up in the middle of the night and just hop over to bad habits.

It’s starting to feel like a distraction reducing superpower.

Next steps

Eventually, having subjected all apps on my phone to triggering “one sec”, I will finally setup “one sec” to trigger “one sec” whenever it opens. This will likely initiate an infinite loop that causes my phone to melt completely⁷.

In this highly probable scenario, maybe I’ll just switch back to a flip phone?

More likely I’ll buy a smartphone again… but I know the first app I install will be “one sec”.

I suppose I’m being presumptuous in assuming the Twitter bird likes black oil sunflower seeds. The birds in my yard go wild for the stuff, so seems like a strong likelihood ↩︎
I do wonder if screen limits are even meant to_actually_ stop you, or just check off a box on device usability for Apple. All screen time tracking usually does is make you think “wow.. I use this phone too much”. There’s no behavior change. Information in isolation does not change behavior.

This is a bit like mandates to make it clearer to consumers what the nutrition facts are for different foods so people can make “informed decisions”. This helps some people do that, but for most people it’s meaningless information that they completely ignore (or notice, but eat/drink it anyways and just feel bad about it later).

It’s highly anecdotal but I’ve never met anyone who found screen limits to be helpful. ↩︎
This is from the “one sec” blog, which is a good read in general.

Honestly some of their blog post illustrations are fantastic - much nicer than I expected such a utilitarian app to have. The referenced post in particular looks great, so here it is again https://one-sec.app/max-planck-study/ ↩︎
If not, then get your social media on! Who am I to judge if it makes you genuinely happy? ↩︎
The sentiment is right, but figures of speech are so weird. Also I like to think no one has ever actually done this 😰 ↩︎
https://rationalwiki.org/wiki/Variable_ratio

I think this tracks. Essentially using Twitter for me is like using a slot machine. I get hits of value, but on a variable schedule. Because I don’t know when the next hit will occur, but I know it probably will, I keep going indefinitely. ↩︎
I am genuinely curious whether it’s possible and if apple or the “one sec” developer have handled it but I’m too scared to try it 😅 I know my phone won’t melt, but could I crash it? ↩︎

Is there an ideal coding style?

Mon, 15 Mar 2021 09:50:00 -0400

Nope! There isn’t.

The coding style you use is a preference.

Even if studies have backed your preference¹, it does not matter. The next team you work on or codebase you inherit may not follow that preference. But if they have implemented a consistent style, use it. If you are open to using it, and force yourself to code that way², you will start to think in that style quickly. Your old preference will start to stick out in code you read because your brain has recategorized what is “normal”.
Consistency is the only thing that matters. Having a consistent style is better than not having one. Enforcing it is better than that. And automating enforcement so no one has to think about it is best of all.

One of the best ideas I’ve seen in a language is gofmt. That command is the code formatter shipped with the Go programming language. How convenient is that? They made the choices for you so you don’t have to waste time. You can do what matters: focus on what you’re trying to build.

In all likelihood, your language community is not in agreement about what a “good” coding style is. I know nothing about the Go community but I’d bet money there are things gofmt does not do that the community has discussed endlessly. What does that mean? It means community is a baseline, but everyone is just making a choice. You should also make a choice, automate it as much as possible, and move on.

Each language community will usually have one or more style guides. Those will include consistency things (like code formatting) but they also include things like code quality.

Code quality, or code styles that aim to reduce developer error or confusion, are still preferences. But they at least merit thought before making a decision. Automate away aesthetic preferences because they don’t impact how the code actually works, and they’re (in general) safe to automatically apply. Then automate detection of code quality.

Some code qualities that could be worth detecting are cyclomatic complexity (number of branches in a method), file/class/module size, code duplication and use of immutability. You’ll find that most languages have code quality tools called linters/static code analysis software which can raise warnings for you automatically.

Once you’ve locked in your preferences:

Your aesthetic decisions should be applied on save of a file
Code quality red flags (that again, you decided on) can be raised as you’re saving code
You can write/review code and know that as many small decisions as you could remove have been removed
Congratulations! You can focus on the purpose of the code and free up mental effort from the minutia! ³

If you’re not sure what to choose for your standards:

Start with the community. See if there is a popular source for styles (style guides). The larger the community, the more of them there will be. In those cases go for most popular, but any will suffice.
Utilize tooling to enforce your decisions. Prettier and ESLint are a good choice in the Javascript world. Rubocop and Standard are good options for Ruby. Every popular language has something, so search for linters and static code analysis tools.
Generally follow community and language idioms. While consistency is paramount, having a style totally inconsistent with your programming language community might turn off potential collaborators or hires.
If you decide to do something different, codify it in your linting rules (in a tool like ESLint) or if it can’t be codified, document your decision. Then immediately stop worrying about it.

Still having trouble deciding on some common, contentious style decisions? I can help.

snake_case vs camelCase? Both animals can be dangerous in their own way. See which animal the community has bet on (I suppose Python boxed themselves into snake_case with the name) and use that. Keep anti-venom nearby for snakes and a spit shield nearby for camels. Be happy you aren’t using 🦛case.
Tabs vs spaces? Burn some sage. Make sure the room you’re in is free of any negative energy. Breathe deeply. Count to ten slowly. Realize this decision is not important. Still struggling? Maybe you’ll feel better coding in Whitespace.
Single quotes vs double quotes? If you’re in Javascript, why not backticks? If you’re in Kotlin, a single quote is a character and a double quote is a string, so you can’t decide, it decided for you (rascally devil). Consult your MLA guide for more information⁴

yes, there are studies done on things like how quickly people can comprehend certain code conventions. Xah Lee references some of them here ↩︎
I think forcing yourself to code in a new style when things change is always a good idea in general. Don’t lean on what you know, lean into what you don’t know. If your language or framework of choice adds new conventions/methods and you use that new version, try thinking that way instead of the old way (particularly if the old way is deprecated, or becomes very uncommon). Holding onto your preferred approach in software when things change is like trying to capture a stream with your bare hands ↩︎
seem like overkill? It might be if you’re working alone. But from my own experience working on dozens of freelance applications as a solo developer I can confidently say, even with your own style/preferences, you’ll write some code outside of your own style and quality standards and totally miss it. Getting a linter hooked into your code editor is pretty straight forward now, and frameworks like blitzjs build in linting tools from the start! ↩︎
don’t, I’m joking, the answer is it doesn’t matter just choose one ↩︎

Clean Subdomains in Laravel

Tue, 26 Jul 2016 17:27:55 -0400

When building a website or application there are a variety of reasons you might want a subdomain ranging from SEO to full separation of application responsibilities. Craigslist uses subdomains to separate it's different regions within states. Other websites use a subdomain to differentiate their blog from their main marketing content.

In my case, i've developed several web applications that have had siloed account logins (commonly referred to as Multitenancy). The way I approached that requirement was to subdomain the site, and use that subdomain to differentiate which tenant you were trying to access.

In Laravel this is pretty trivial to setup, but there are some snags associated with it and some nicer ways i've found to handle it. In this post we'll see what Laravel offers by default, how we can clean it up using middleware, and talk a little bit about how to integrate it into your overall app.

How Laravel handles subdomains

Adding a subdomain in Laravel is trivial, and in our case we want to treat the subdomain portion as a wildcard that is handed to our code (see the overall docs for more on Laravel Routing).


    //routes.php
    Route::group(['domain' => '{subdomain}.mysite.com'], function () {
        Route::get('/', 'SubdomainedController@index');
    });

    //SubdomainedController.php
    namespace App\Http\Controllers;

    use App\Http\Controllers\Controller;

    class SubdomainedController extends Controller {
        public function index($subdomain) 
        {
            dd($subdomain); //die and dump the subdomain
        }
    }

From that example you can see the subdomain can be specified as a simple option called domain to the Route#group method. Here we're saying we want the subdomain in {subdomain}.mysite.com to be a variable parameter (allowing us to use a wildcard subdomain which can be controlled from a configuration file or database entry).

The other thing we see is the actual controller definition, where the #index method is being supplied $subdomain as a parameter. We can then use that parameter to determine how to handle the current user and url. From here we could create a common method or service that we call within our controllers and handles the subdomain logic for us...

But what if every controller in our app is under that subdomain? What if we have 50 controllers and each controller has some slice of the typical REST interface? Now we have to remember to not only call this common helper in every single method in our entire application, but we also have to clutter our method signatures by having every single one include $subdomain as a parameter.

It works, but it kinda sucks. We can improve it.

Cleaning up our method signatures with middleware

We can use a combination of middleware and request modification to both utilize our subdomain and keep our controller interfaces clean. Here's what it looks like:


    //routes.php
    Route::group([
      'domain' => '{subdomain}.mysite.com'
      'middleware' => ['subdomain']
    ], function () { ... }

    //Subdomain.php
    namespace App\Http\Middleware;

    class Subdomain 
    {
        public function handle($request, $next)
        {
            $route = $request->route();
            $subdomain = $route->parameter('subdomain');
            $route->forgetParameter('subdomain');
            //store parameter for later use
            return $next($request);
        }
    }

    //SubdomainedController.php
    class SubdomainedController extends Controller 
    {
        public function index() 
        {
            dd(func_get_args()); //die and dump any arguments - we no longer have the subdomain!
        }
    }

To make this work, we start by grabbing a representation of the current route being acted on using $request->route(). This instance gives us access to information about the route including the current url, what HTTP verbs it responds to, and the parameters that have been supplied (if any). We utilize the #parameter method to retrieve the subdomain part of the url.

Once we have a handle on the subdomain, the key to this middleware is the call to #forgetParameter. By calling this with our parameter name 'subdomain' as the argument, it's no longer handed to our controller methods. We can get access to the parameter without cluttering responsibilities in our code. After we get the parameter we can then store it for later use (in the session, in the database, in a global app service).

A note about the #forgetParameter method: It took a little digging and I wouldn't suggest it everywhere, but I think this is a case where it makes the app cleaner and is more inline with our expectations. In general I think of the Principle of least surprise in these cases. By which I mean modifying the request so that it no longer matches our expectations could be considered "surprising". But the alternative responsibility that you place on every single controller is a much bigger problem in my opinion (and a big violation of the Single Responsibility Principle).

Testing your subdomain locally using `php artisan serve` or any local server

If you're using php artisan serve to run your local server and you want to test using subdomains you have to make a small modification. Add the --host option and set it to 0.0.0.0.

php artisan serve --host=0.0.0.0

This allows your server to be available from any interface of your machine. It will be accessible by other computers on your network, tools like browserstack in local mode, and most importantly it can be configured with a local subdomain using your hosts file.

127.0.0.1   dev.mylaravelproject.com

Now you can go to dev.mylaravelproject.com:8000 and it will point to your local development server. Laravel will be able to pull the subdomain using your route group, and you're good to go.

Making the subdomain less hard-coded

You may have noticed the example I showed for local development (dev.mylaravelproject.com) and the route I setup weren't identical. One option is to make them identical, but then if you are switching between testing your actual server and dev server you would have to continually update your hosts file. You could manually change the entry, but that's error prone (wouldn't want to update production with a route pointing to "mylaravelproject.com" instead of your actual url). So I think you have two options:

1. Make the entire route configurable

2. Keep the route as an entry in your environment file

Making the entire route configurable

There's nothing that stops us from having more parameterized pieces of your subdomain as in this example:


    //routes.php
    Route::group([
      'domain' => '{subdomain}.{domain}.{tld}'
      'middleware' => ['subdomain']
    ], function () { ... }

We now have three parameters, one representing the subdomain, one the domain and one the tld (.com, .net, etc). In our Subdomain class we could pull all the pieces out, and similarly call #forgetParameter on each of them. This works, but I think it might be overkill and a little deceiving. If we aren't actually going to use the domain and tld for anything, why parameterize them - just for convenience? And if the URL ends up more elaborate at some point, this pattern may need to be adjusted again. Workable, but not ideal.

Making the route an entry in your environment file

The other approach is to make the entire string configurable at the .env level. I prefer this approach, because it means I can keep it parameterized for the subdomain but make it as different as I want otherwise. I just change the env variable depending on where i'm deploying to.


    //.env
    DOMAIN_URL={subdomain}.mylaravelproject.com

    //routes.php
    Route::group([
      'domain' => env('DOMAIN_URL'), //Use the environment value instead
      'middleware' => ['subdomain']
    ], function () { ... }

Now we can still focus on the {subdomain} parameter alone, and have the configurability we need to move between environments easily.

Closing

We've learned how to use subdomains in Laravel, what it does to our code, how to make the experience cleaner, and how to keep it configurable. It took a few extra steps, but adding in these settings and middleware will make your code more easily maintainable as you move forward. Now that we have a subdomain in our site: what can we do with it, how do we track it, and what does it mean for our users? We'll explore that in a later post. Let me know if you have any questions or input in the meantime by leaving a comment!

Using the Device Clipboard in React Native

Fri, 26 Feb 2016 00:28:17 -0400

As a quickly evolving library, React Native is often supplemented by features from the community, only to consume them or incorporate similar concepts later on.

Up until a few months ago, using the native clipboard on iOS or Android meant installing an npm module and then configuring the native components in each device environment (configuring the library in XCode or modifying your configuration for Android).

It wasn't a huge burden, but it still required some platform specific knowledge and a few extra steps beyond being able to just plug and play with the module.

Existing packages

If you've tried using the native clipboard before in a React Native project you've probably used one of the two main libraries available for it: react-native-clipboard (iOS and Android) and react-native-pasteboard (iOS only).

With release 0.17 of React Native, those packages are no longer necessary. We now have a Clipboard component which works seemlessly for both iOS and Android.

How to use it

The functionality is as straight forward as it gets. There are no configuration file changes or library linkages required. Just import from the react-native package and go to town.

Here's how to import the component using the ES6 import syntax


  import React, {Clipboard} from 'react-native';

Then to set a value to the device clipboard call #setString


  Clipboard.setString('my string');

Getting the current value is a tiny bit more work, as it's returned asynchronously in a promise returned from #getString


  Clipboard.getString().then(function (content) {
    console.log(content);
  });

And if you want to use the ES7 await/async functionality (which React Native ships with enabled by default), it looks like this


  async function getString() {
    console.log(await Clipboard.getString());
  }

Just remember that your code has to be in the context of an async function for await to work properly.

The whole example


  import React, {Clipboard} from 'react-native';
  Clipboard.setString('my string');
  Clipboard.getString().then((content) => {
    console.log(content);
  });

Now with only a few lines of code you can set a value to the device clipboard, and retrieve it.

And that's it. Go forth and copy/paste.

React Native impressions from a Cordova user

Wed, 02 Dec 2015 15:47:02 -0400

Introduction

As web developers, we see the potential that the web offers, as a platform and as a cross platform development environment. I don't think it's a desire for sameness or lack of imagination that propels web technologies across device targets, but the potential and power of what those tools can provide.

But being so ubiquitous means that solutions often fall short or feel off in comparison to what a platform provides natively. Native application development vs development using a web stack is a source of constant debate, and generally when tested at the extremes, web technology struggles to hold up to what a native experience can offer.

The gap is closing. Cordova has been a great environment, but as a baseline it is difficult to achieve the same level of polish and native feel that comes more natural from a native application. Frameworks like Ionic and Framework 7 make bridging that gap much easier. Their creators and developers have taken the time to polish and provide a set of tools, interactions and widgets so you don't have to try and build it from scratch every time (and often fall short in trying).

But what if we could utilize our familiarity with part of the web stack (javascript) to delivery something truly native? A seamless experience delivered by web tech but not bound by it.

React native might be the answer to doing just that.

Where I'm Coming From

Talking about React Native as someone who uses web technologies in almost all of my development, a meaningful comparison would reference back to that tech. When is and how is React Native more compelling than what the competition offers?

What makes it a better alternative to a tool like Cordova matched with Ionic or Framework7? What are its clear strengths? Where is it lacking and where will it improve? That's what i'm trying to answer for myself, and explore in the rest of this post.

A Very Brief History

React is the base library that powers React Native, and is a view layer, written in Javascript, and originally created by engineers at Facebook. Since its initial release in 2014 it has risen in popularity quickly (32K stars on github and rising) and people have been embracing its model. You write all of your view layer in their abstractions, and let them manage rendering to the appropriate format. They manage your UI and the flow of your data in an optimized way.

We're seeing them refine React into a reusable base library, and extract the view implementations into their own modules (they recently removed all DOM related logic and created a new package - react-dom - to handle that for the web side of things). In addition to react-dom in the browser, people are using React for server-side rendering, and Flipboard recently came out with react-canvas - which writes your UI directly to the canvas tag. These are just a few examples of React view implementations taking the base React layer and applying it across different environments.

The obvious momentum here is towards reusing the React architecture to output to different formats. React Native takes these same principles, and applies it to the native app environment.

Why React Native

Learn Once, Write Anywhere

"The focus of React Native is on developer efficiency across all the platforms you care about — learn once, write anywhere." (https://facebook.github.io/react-native/)

With the ability to learn all of the react principles and patterns, and then apply those to whatever view abstraction can be written, you're able to take all of that knowledge and apply it on multiple views and across different platforms. Not just platforms that contain a browser but any platform where javascript can be run.

You know react from using it in other environments, and you're now able to leverage that to build a native app. The components are different, how it functions is different, but the concepts are all the same.

Truly native

Everything you use in React Native is, behind the scenes, a native UI component. Many of the features that native developers are accustomed to on Android and iOS are available to you in their actual forms.

My understanding is that React Native makes this possible by serializing communications from your Javascript layer to a native layer, where the UI work is accomplished. Your javascript SwitchIOS component is a native iOS switch. Same for your MapView and ListView. It's all first-class with the rest of the app environment. You define what you want to use in Javascript, and React Native handles the communication and implementation of how it is rendered.

The result is a larger ecosystem of code you can lean on, as well as out of the box improved performance.

Performance

One of the top reasons to use React Native is for performance. Because your javascript is communicating instructions to actual native code, you are able to operate at near native speeds. Your animations, gesture recognition and layout rendering will all be faster with no additional effort on your part.

Knowing that you are getting these out of the box gains, the main concern you are left with is keeping your javascript code running lean. Javascript operations are batched up and then sent to the native layer, but if your processing takes too long you can miss some of those communication windows and cause visibly degraded performance.

The React Native's team is heavily focused on getting the best performance out of every aspect of the library they control. As part of their developer tools you can actually monitor the framerates of both your native UI and javascript threads, which helps to debug where a slowdown might be occurring.

First Class Native Projects + Keeping up with iOS/Android updates

With a React Native project your generated iOS and Android applications are a first-class part of your development and source control. The obvious benefit here is unobstructed control of all of your application settings and libraries.

In the Cordova model, your "platforms" (the implementation of code for a given operation system like iOS and Android) are read-only and are not intended to be source controlled. The output of that generated code is controlled from generic configuration files within your project.

The Cordova approach is a double edged sword because while you are benefiting from the generic cross-platform nature of your configuration files, you can also lag behind new developments and be forced to wait on Cordova updates or make modifications to the runtime yourself. Recently there were issues with the introduction of Application Transport Security in iOS 9, which meant having to use some hacks to get around the problem until Cordova could release an update to address it (which took some time).

Using React Native you trade off the generic configurability of Cordova for full project control.

Community

The React Native community is growing fast. Much like the React community, React Native is garnering tons of attention and contribution. As of this writing they have around 24k github stars, and new modules are coming out for it every week.

But,

It doesn't feel quite like an ecosystem yet as much as a large group of people excitedly producing code. There are a number of React Native modules started and abandoned, many of the modules are not mature enough yet, and they're working on a moving target. The React Native team is doing a good job avoiding massive breaking changes, but the framework is young and these things are to be expected.

App Reloads / Dev Tools

Ability to reload from the app was awesome, and something Cordova and Ionic are only just starting to offer.

You can bring up and options menu while running your application in real-time, and it comes with several options including live code reload, remote debugging from your browser, and performance statistics of your running code.

Touch Events

The gesture system in React Native is a rich and complicated aspect of the library. They claim that the touch interaction offered by the web is a big usability issue, and that "Users can feel huge differences in the usability of web apps vs. native" ref.

I've seen and worked with some great touch interactions in a web context - but I believe them that there is more the user could be offered. I just haven't seen the value in action yet. The "Gesture Responder System" is a fairly dense API and I haven't gone beyond basic tap interactions with it so far.

UI Flow

Everything being native, the flow and styling of your application is very consistent. Contrast this with trying to integrate a native component into a Cordova application and it's a very different experience. Bringing in something like the native Map component and trying to style it into a completely different UI paradigm (your webview) can be confusing and the precedence of elements unclear.

In React Native, it's as easy as laying out your components and styling them using a subset of CSS that you define in your javascript. The components interact and flow with each other naturally with no magic involved, because everything is speaking the same language.

Where it's the only option

One of the most interesting aspects of React Native is its ability to go places that standard web technologies cannot.

Up until a year ago, Cordova was still an option for all the common "smart" technologies that people used: phones and tablets for Android, iOS, Windows Phone (even Amazon FireOS, Firefox OS, Blackberry, and more). But it wasn't until the Apple Watch came out that hybrid developers found themselves at a loss. Watch OS 1 and Watch OS 2 do not support UIWebView.

Around six months later Apple released tvOS. Another platform intentionally missing UI and WKWebView.

There is a reasonable explanation for why they're excluding webviews: they just don't make sense on those platforms. The reason they include webviews is to be able to display web content. But interacting with a web page on your watch, even a responsive one, is cumbersome at best. tvOS has a similar problem in the opposite direction - they don't expect people to be interacting with webviews on a large screen using a remote control. Wired explains it pretty nicely: "Instead of tapping and swiping a 4.7-inch display with your fingers, you’re using a remote to control a 55-inch digital tapestry from 10 feet away. It’s like trying to play a banjo with two oven mitts on."

So without a webview, what can a hybrid developer do? The answer right now is nothing. So far there are no solutions to get around this problem.

So in the case of these two platforms, React Native has the potential to shine. Being a native technology that doesn't rely on a webview, we're able to adapt to almost any platform.

The community is already working on it for tvOS: https://github.com/facebook/react-native/issues/2618

Watch OS is not receiving much interest so far, but that certainly could change over the next year: https://github.com/facebook/react-native/issues/685

Unless Apple releases webview capabilities for these platforms in the future, Cordova will never be an option for them. So you're left with either developing something entirely natively, or using an environment like React Native.

Scratching the surface

There are alot of other reasons to look into React Native and what i've described are just my impressions, understanding and usage so far. Check out their docs for more. I haven't had a chance to fully utilize it yet, but their animation implementation looks powerful.

Why Cordova/Ionic

We've gone through what i've learned about React Native so far and what has me excited. But what are the positives to continuing with a tool like Cordova?

Familiarity

The obvious win for developing applications using standard web technologies is familiarity. When you develop an app using a tool like Cordova, you are writing standard HTML, CSS and Javascript. Whatever tools or approaches you add on top do not change the fact that this is the same code that could also power a website.

Does this familiarity actually play out in practice? Do web developers navigate seamlessly into the mobile world?

My experience is that no, it's rarely that straightforward or simple. Frameworks make it easier, but there is always a learning curve to thinking with a mobile mindset, understanding the proper interactions, and dealing with the quirks of developing in this environment.

But is the learning curve smaller - definitely.

CSS, not "Styles"

I glossed over the way React Native handles styling when I talked about UI Flows earlier. That's because despite the benefits of of all your components interacting consistently and performing nicely, the way you style them is not exciting. It works, but there are some definite weaknesses (you can read through a speaker deck about why they made these choices here. It's at the top of the article).

To start, what you know about CSS is only partially relevant here. The full specification is not implemented, shortcuts are not supported, and you have to define it all in your Javascript code. It's somewhat akin to writing all your styles using the dom style attribute, but more limited (everything is assumed to be pixels, for instance, so you define things like margin and padding as simple integers).

For example:


var styles = StyleSheet.create({
  container: {
    flex: 1,
    justifyContent: 'center',
    alignItems: 'center',
    backgroundColor: '#F5FCFF',
    margin: 10,
    marginBottom: 5
  },
  text: {
    color: 'black'
  } 
});

That code compiles your definition, and you can now use that styles object to apply styling to your components.

Applying the styles

So how do we apply this to our components? Let's take a simple React Native code sample, and apply it to that. I'm not going to elaborate on what is going on in the code except to note how the styles are being applied, since the general concepts are the same as any React code.


var ExampleApp = React.createClass({
  render: function() {
    return (
      <View style={styles.container}>
        <Text style={styles.text}>
          My example with styles applied.
        </Text>
      </View>
    );
  }
});

If you're like me, your initial reaction to that example is probably something like this:

It's a CSS styling sin to employ inline styling at all, let alone as your primary mechanism for your entire layout. It's a bummer, but in practice it's not so bad. In a sense the styles are acting a bit like selectors, in that you are defining your style elsewhere and referencing it from here. There is some amount of cascade that goes on here as well, though I haven't identified all the rules around it yet and have encountered some surprises along the way.

Flexing your layout

You may have noticed I have some flex properties defined in that example, and that's because in addition to only supporting a subset of properties, React Native only supports one layout option: Flexbox. This also seems like a downside, and technically it is (more options being better than less), but it turns out to be a pretty solid choice. Flexbox supports the app layout and concept nicely so in practice this is not a huge determent - just a learning curve for many developers (including myself).

Miscellaneous

There is no nesting of styles, so you end up defining multiple style objects, or prefacing your definitions with additional context about where it goes. It doesn't always use standard names and properties: flexDirection instead of flex-flow, for instance. So even where it is using standard CSS concepts there are some subtle differences. (correction).

React Native's approach to styling works, and at least you can take advantage of many of the principles of CSS... But it's not CSS. It's comparatively cumbersome, verbose, and seems likely to stay that way unless projects like react-native-css take off and become a community standard.

Back to Cordova

So developing using standard CSS means you have the full feature set of CSS available, and as new features are added you can start to take advantage of them immediately. It works the same way you're used to in your regular web development, and that's a pretty large advantage.

It also means you can potentially use things like CSS Grid Layouts. I'm going to back away slowly now, before i'm pressed further and you realize I have no idea why that would be useful to you. But it's coming, you won't get it with React Native today, and may never be able to.

Updating your app without updating your app

The app submission process is not known for being speedy, so having options to update application functionality without going through a full submission process is a definite benefit towards developing applications with Cordova. There are lots of tutorials explaining how you can achieve that, so I won't go into it here. But being able to load in your javascript and CSS remotely means faster code turnaround and faster deployment of new features to your users.

However

Originally I thought this would not be an option for React Native, but was proven wrong pretty quickly. It's not quite as straight forward as what you can do with Cordova, but there are approaches available.

Apphub offers a service for it: https://apphub.io/

And here is a medium article about dynamically updating your app: https://medium.com/ios-os-x-development/so-you-want-to-dynamically-update-your-react-native-app-d1d88bf11ede

Richer pre-built components

In fairness, React Native is a young library. And compared directly to straight Cordova, they actually have an advantage because of their out of the box native components.

But the existing pre-built components (both in the project and in the community) are still pretty limited at the moment, and not as mature as what you'd get by using a library like Ionic.

I was disappointed to find no Pull to Refresh or Swipe action implementations available in React Native itself. From what i've read in their responses to issues is that they aren't interested as a core project in coming up with a huge array of components - they want the community to get involved to build that out while they focus on the library itself. I can definitely respect that, and I look forward to what gets produced over the next year. There are already some community pull to refresh and swipe action implementations available, but the quality is variable right now.

So compared to what I can use now in Ionic, if I had to build something on a tight timeline and use a breadth of components, Ionic has a clear advantage.

Example

Before I close, there are a couple great tools I want to mention for playing around with React Native with the lowest barrier to entry possible.

Appetize.io (https://appetize.io/) is a service that lets you stream iOS and Android apps in the browser. While not directly relevant to React Native, it's what powers the next tool.

Rnplay.org let's you write React Native code in your browser and test it using their app viewer (which is powered by appetize.io). Other users can fork your code and for trying out React Native for the first time there is no better option (as long as you don't need to branch outside the core components). I've embedded an example here for you to try. Feel free to fork the code and see what you can put together!

https://rnplay.org/apps/aPx5Uw

Conclusion

You'll notice that where I have complaints about React Native, most of them are surface level, or related to the young age of the project. I'm excited about what React Native is offering, and excited for what is next for them.

I've been using Ionic/Cordova for awhile now, and really enjoy it. I've deployed client code using it, and will continue using it for the foreseeable future.

But React Native has definitely caught my eye, and I think it's a powerful and interesting project. It's not usable for everything just yet - there are some widget niceties that I think it needs for more mass appeal and quick use. But it will get there, and using it has been fun and productive so far.

I've got an app coming out soon that I will write about more in depth in terms of actual implementation details using React Native. Something very simple, but it gave me a good way to test the tech and see what React Native could offer and handle, and fueled everything in this post. I'll go into specifics about what modules I used, benefits of the native experience, issues I hit, community responsiveness, resources available, and bugs in the platform I encountered.

So what is my final feeling on React Native? Excited, and a little mixed. There's a side of me that loves everything they are offering and the possibilities, but it is also at odds with my love of a more pure web tech solution. Despite using many elements of web technology is also eschews a lot of it and does its own thing. So if you're a web developer looking to try React Native - you'll feel comfortable there, but not exactly at home.

Update 12/5/15:

Thanks to @notbrent for some feedback on this post. Notably pull to refresh just landed in core ios/android, and a mistake I made about non-standard flex properties (flex-direction is valid. While react-native doesn't support shortcut definitions, the properties it supports should be capable of handling all simple forms in a standard way)

Resources

Sun, 08 Nov 2015 21:28:53 -0400

I thought it would be helpful to create a resource page to show how I approach development, and what tools and resources I use/read. It’ll be a running list, and I will be updating it regularly. I recommend bookmarking it for your reference and convenience. Enjoy!

Disclosure: Please note that some of the links below are affiliate links, and at no additional cost to you, I will earn a commission if you decide to make a purchase. Please understand that I have experience with all of these companies, and I recommend them because they are helpful and useful, not because of the small commissions I make if you decide to buy something. Please do not spend any money on these products unless you feel you need them or that they will help you achieve your goals.

My Most Recommended

Focus@Will: I use this service every day, and it has really changed the way I focus when I’m working. They describe themselves as “A neuroscience based music service that helps boost cognition by up to 400%“, and although I can’t give you an exact percentage to describe my increased productivity, there is a noticeable difference between how effective I am when using it vs not (vs just listening to regular music). I’ve used it for over 2 years, and almost never listen to anything else while i’m working.

Laracasts: Laracasts are in-depth educational videos made by Jeffrey Way, which were initially targeted at Laravel development. From very early on they expanded into software architecture, testing, development/environment setup and PHP/Javascript best practices. The content is top notch, and the site itself is intuitive and great for setting your learning goals. The content is broken up into Lessons (individual videos), Series (cohesive series of Lessons) and Collections (related Lessons grouped together). You can mark them as completed, to be watched later, favorite them, and even download the videos for offline usage. At the time I added this it was up to 611 lessons, and growing.

Hosting

Heroku: You can’t get much easier than Heroku for hosting and prototyping some code. Initialize a git repo, point it at heroku, and git push: you’ve just deployed and started your server! I’ve worked with Heroku for years from prototyping code to live production setups.

Linode: Linode is my first choice when I need a custom setup and want a solid VPS. All of their options are SSD now, and the pricing and management of your server is intuitive.

Shopify: Shopify is an awesome resource if you need to setup an online store. Everything you need to sell your products online, on top of a solid platform that is easy to theme and extend.

WP Engine: If you specifically need WordPress hosting, WP Engine is one of the best dedicated options available.

Freelance

Freshbooks: I used to be a big fan of Harvest, and I still think that’s a great service. But switching to Freshbooks has been great, and it’s more comprehensive than what Harvest offers. I use it for time tracking, expenses, invoices, and subcontractor timesheets.

Cushion: Cushion has been a great tool to help forecast my business and income. You can setup income goals, track project timelines, and get insights into your relationships with clients.

Developer News

Most of my developer news comes from one group, Cooper Press. Keeping up with development news across different languages and specialties can be difficult to do on your own, so i’ve turned to over a dozen different newsletters over the years. The stuff Cooper Press produces has blown most of it out of the water, which is why 5 of the 6 newsletters listed here are from them.

Javascript Weekly: Weekly resources and news from the JavaScript world, including some node.js content

HTML5 Weekly: Weekly resources and news from the HTML5 world

Mobile Web Weekly: Weekly resources and news focused on mobile web development, ranging from responsive web apps to tools like Cordova, React Native and Nativescript.

Node Weekly: Weekly resources and news focused on node.js

Ruby Weekly: Weekly resources and news all about Ruby, Rails and friends.

Laravel News: This is one of the few non-Cooper Press newsletters I subscribe to. It’s curated by Eric L. Barnes and he’s done a great job combing through the best Laravel related news from the week (plus some general PHP/JavaScript usefulness). Highly recommended if you have any interest in Laravel.

Education

Ruby Tapas: Much like Jeffrey Way, Avdi Grimm is a great educator and his videos are enjoyable and instructive. His content is much more focused on Ruby with less emphasis on any frameworks. Each “Tapa” is a bite-sized, targeted piece of content that illuminates topics ranging from Ruby best practices to general architectural choices. Unlike Laracasts though, the tapas site is much simpler and doesn’t offer most of the features Laracasts does (like grouping videos into Series, or marking a video to be watched later). You can however get a feed of the Ruby Tapas videos and easily watch them in iTunes or on your phone without logging into the site, which is a plus.

Code School: I’ve subscribed on and off to Code School, but always recommend them. Great videos accompany real-time coding exercises ranging from beginner to advanced content, and covering JavaScript, Ruby, SQL, and HTML5 among others. Even beginner content in languages and frameworks I’ve used for years have taught me a thing or two!

Safari Bookshelf: Safari Bookshelf has been a great tool that i’ve been using for several years. If you’re like me at all, there are usually dozens of technical books you’re looking to either pull chapters from or read in full, and buying them in every case could be prohibitive. Safari Bookshelf offsets the cost by giving you monthly subscription access to virtually every resource they have available to them. I’m more wary of recommending them these days because their costs have gone up alot ($39/mo). The value is there (I think) so if you’re interested give the free trial a try before jumping in.

Dev Tools

Jetbrains develops the highest quality IDEs (integrated development environments) available. Outside of text editors, they are always my first choice.

RubyMine: Awesome editing, tooling and debugging for Ruby and Rails.

PHPStorm: Has all of the features of their web-oriented editor WebStorm, plus tooling for PHP and related frameworks. Also offers the easiest PHP debug setup i’ve ever used.

Atom: My choice for coding when I don’t need the full features of a Jetbrains IDE or i’m just doing some quick editing. Atom is very similar to Sublime for anyone familiar with that, but built on an open source project with tons of extensability.

Lucid Chart: Lucid Chart is my favorite answer to Visio in the cloud. I needed a solution for creating charts/graphs/flows and UI mockups, and Lucid Chart is a fantastic tool for that. Also has plenty of options for sharing (png / web page / pdf) and collaboration.

Github: The programming community standard for git-based source code management. Having an account here is a must for tracking or contributing to any open source project you’re interested in, and most of my clients manage their code here.

Bitbucket: The user interface isn’t as nice as Github, and it’s usage isn’t as ubiquitous, but it has two big things going for it: free private repositories and more fine-grained collaborator management in their free version.

Personal Development

VIP Success Coaching: This is a coaching program from Hal Elrod, author of The Miracle Morning. Topics are varied but some examples include growing your business, wealth management, and productivity.

Books

Miracle Morning: This book has had a huge impact on my life and business and set me on a path of growth that just wasn’t there before. Change your mindset about what you’re capable of and what you can accomplish, read and follow the challenge this book prescribes.

Better Than Bootstrap Javascript: Click Delay

Mon, 23 Feb 2015 21:03:41 -0400

Last week we dug into Javascript Carousels, first analyzing the implementation Bootstrap offers and then examining two alternatives in Flexslider and Slick. We reviewed examples in each and noticed the simplicity of setup as we moved beyond Bootstrap - but that wasn't enough. Developer productivity and effectiveness is important, but what else did alternatives have to offer?

Ultimately, the user experience was the major differentiator. Flexslider and Slick offered swiping and dragging for navigation, natural animations, and fine tuning for responsive displays. One important differentiator was around user feedback - how the carousel responded to a users tap or click, and the detriment of the click delay.

This weeks post is a bit different in that it's not specifically comparing a Bootstrap component to an alternative, but comparing the absence of a feature and how you can implement it in your web apps. Let's get started.

Click delays and the smartphone tap

If you're an iPhone user, i'd encourage you to visit the Bootstrap website on your phone and tap around on the links and features displayed there. A great example on the phone would be clicking the "hamburger" menu (the three stacked bars you see at the top right corner of the page). You'll immediately notice the short delay between tapping the menu, and actually seeing the content open or close. Is it loading from the server? Did my action register?

Looking at it in action, it's not painfully slow. Is there really an issue? Is the perceived latency that big of a deal?

Now go to the Google homepage, and click the menu in the top left corner (again, the same "hamburger" style menu, with three horizontal bars). Here, the reaction is immediate. You tap, and right away it slides open and gives you a scrollable list of pages. Tap it again (now moved to the right) and it immediately slides closed.

The experience on the Google homepage is a clear improvement over that of the Bootstrap site. The Bootstrap responsiveness seemed fine, but then we experience Google's menu and the Bootstrap interaction feels less natural. The responsiveness of Google's menu is instant - difficult to distinguish from a native application.

What's the deal, Bootstrap?

You might be wondering why Bootstrap, being a mobile first framework, wouldn't take care of something like click delay for you?

I've never looked into it specifically, but Bootstrap's reasons could fall into a few camps - reducing dependencies, inconsistencies between the multitude of platforms Bootstrap tries to support with one codebase, or feeling that browsers will eventually fix the issue for them.

I used to be a jquery mobile user, and their goal is to support as many mobile platforms as humanly possible. They found removing the click delay was problematic while trying to maintain maximum platform support, so they left it as a task for the developer to deal with on a case-by-case basis. Since Bootstrap does the same, it's up to us to figure out how to approach the issue.

So how can we move ourselves beyond the sluggishness of the Bootstrap site with their click delay to the immediate feedback that we get in the Google example?

Immediate feedback with touch events

The first approach you could take to resolve this might be with touch events. Touch events are fired immediately when a user taps something. Using touchstart, we recognize the users tap and can react right away.


$('.action-button').on('touchstart', function () {
  alert('no click delay');
});

The problem is that this is only useful for isolated interactions. What about regular pre-existing links on the page - do we apply touchstarts to them all? What if we want to prevent the default behavior and perform a different action when something is clicked? How can we apply this to existing javascript code running on the site without modifying it (for instance, how could we improve the speed of the Bootstrap menu)?

Pretty quickly, the depth of the click delay issue becomes apparent. In principle the solution could be simple, but the reality is that it bleeds into all aspects of your site, and can be inconsistent to implement between browser vendors and devices.

Leveling the playing field with FastClick

The most popular approach available to solve the click delay issue is with a tool called FastClick. They take the issue and approach all of the multitude of scenarios for you, all wrapped up in a very simple API.

Since we're talking about Bootstrap here, I'll assume jQuery is available and use it to load FastClick once the DOM is ready. You get a reference to the body element (which can be accessed directly from the document object using the body property), hand it off to FastClick, and you're good to go.


$(function () {
  FastClick.attach(document.body);
});

This is the shotgun blast approach. Everything on your site is now freed of the click delay - buttons, links, click events - all of it. FastClick attaches itself to the touch events that are immediately responsive (similar to what we did with touchstart earlier), and handles the rest for you from there.

Now when you attach a click event to something, the delay is gone. Not only that, but our ability to suppress the default behavior (for instance, a link changing the url of our browser) is also available.


$('.action-button').on('click', function (e) {
  e.preventDefault();
  alert('clicked immediately!');
});

It's dead simple, and generally it just works.

If we apply FastClick to our Bootstrap example from earlier, we now have the same responsive feel that the Google homepage gave us.

Alternative to FastClick: device-width

You might notice that when I first talked about the issue, I said "If you're an iPhone user...". That's because outside of the iPhone, there are some alternative approaches that browsers and devices take to the click delay.

If you're on an Android phone, check out the sites I mentioned earlier (Bootstrap and Google) in Chrome. Depending on your Android version, and which provider released it, you may not experience any click delay. This is because some browser vendors have been introducing ways to avoid click delay when your site meets certain criteria. In this case - if you define your viewport as being the width of the device - Chrome on Android will remove the click delay for you.


<meta name="viewport" content="width=device-width">

In general, this is an important setting for mobile-first sites, and a Bootstrap recommendation. Having a width of device-width means your viewport width is based on the device you're using. So if your site is responsive, and the width of the viewport is device width, you'll get a display that is better suited to the device.

On Chrome on Android, they piggy back on your viewport setting to mean you don't need tap-to-zoom, and they remove the click delay for you automatically.

You can read more about this here. That article is also nice because it mentions approaches that apply to Windows Phone as well, which works by a different set of rules from iOS and Android.

Why wouldn't I always use FastClick?

Learning about FastClick might seem like a good point to stop the discussion, apply it to your site, and call it a day. And in many ways, it could be. If you have a responsive site using Bootstrap and you want to remove click delay across the board, this is the way to go.

There are some warnings here, though. FastClick works really well, but like any project it is continually in development, and it can have issues. New versions of mobile OS's come out and break things. The browser vendors are not expecting FastClick, they implement their own behavior, and edge cases arise. FastClick meets the browser, meets the rendering and javascript engines, and on the fringes of functionality things can breakdown.

In some cases, FastClick can also impact other plugins. At the time of this writing i've had issues personally getting FastClick and Select2 to play well together, and have had to patch FastClick to bypass the issue. This is something that could happen with other libraries as well, who may try to eliminate the click delay issue themselves, or just have some other scenario which ends up conflicting with what FastClick is trying to do.

There's also the question of whether this user experience matters to you at all. If your site is heavily content driven, and most of the interaction is through clicking links - you may not want to bring in another library just to remove a click delay which only happens as the user intermittently clicks things. Perhaps in this case just manually applying touch events when appropriate is the better approach (for instance, to improve a specific scenario like a carousel, but not to remove it from everything across the board).

The important thing to consider if that FastClick is a tool, and you should weigh the benefit of the tools that you use, and test them on your site. If you're creating a web experience that needs to give the user a native feel - I'd encourage you to use FastClick. Just make sure you do your due diligence and test your site before pushing it off into production.

Should I always use device-width?

If you're designing responsive, mobile-first sites using Bootstrap, then you should definitely be setting the device-width on your viewport. But in terms of its impact on click delay be careful, and understand the extent to which it's available. If you have some sense for what browser or devices you're deploying to, test to see that it actually works.

I've encountered situations where it didn't work, even when it was supposed to. I had a client with a lower-end Android tablet being deployed to their userbase running Android 4. We were having the client use Chrome and the click delay was supposed to be removed. It wasn't. Even applying FastClick we still had some work to do - FastClick tries to intelligently disable itself if it detects a browser and device which should have click delay disabled natively. In the end we had to apply a minor patch to FastClick to remove the check for the native feature.

Approaching the problem differently with FastActive

A very different way of approaching click delay is to solve the problem by altering the users perceived performance and offering immediate visual feedback, even if the actual result of the interaction is not immediate. This can be achieved by using the FastActive library.

It applies an .active class to links and inputs, so you can create the appearance that they are reacting immediately (through CSS). This may be an alternative for you if you do not want to risk platform related click issues, and are more concerned with the users perception of performance.

Conclusion

You'll notice that i've talked at length about user perception and interaction in this Bootstrap series so far. This is because despite my love for the web and its potential, the experience of native is compelling and fluid and fast, and the web struggles to compete with this out of the box. More and more people have an expectation of performance, regardless of whether they're in an app or in their browser. The app experience creates high expectations, and that expectation still remains when using the web. How we deliver on that expectation is through tools and approaches like these.

If approaches like flipboard's react-canvas are any indication, ios has raised the bar and users are expecting beautiful interactions. Flipboards approach was about flawless animation and scrolling. Click delay wasn't even discussed in this situation because it's so basic as to be taken for granted - it's just not an option. When I hop out of a native app where I can swipe, tap and scroll with ease into a website where I click a link that takes 300ms to register, I feel it. And if your app is trying to compel your users to continuing using it, that may make the difference between them coming back or choosing something more compelling (or ditching the web entirely, and using a native app instead).

Better Than Bootstrap Javascript: Carousels

Tue, 17 Feb 2015 12:15:45 -0400

Last week we started small, and looked to improve the perceived performance of our website with interactive loaders. We looked at what Bootstrap had to offer out of the box, and then explored a more robust option with the Ladda plugin.

This week we're aiming to improve the carousel experience. Carousels are ubiquitous on the web and are used for a wide variety of purposes - from picture slideshows to client testimonials to an entire web page experience. And while there are some people who say you shouldn't be using carousels, the bottom line is they won't be going away soon, and are a useful tool to use and understand.

So let's start by taking a look at the Bootstrap approach.

Bootstrapping our Carousel

In comparison to Bootstrap loaders and buttons, Carousels is a more meaty area of functionality.

Here's an example of a simple Bootstrap Carousel.

<div class="carousel slide" data-ride="carousel">
  <ol class="carousel-indicators">
    <li data-target="#carousel-example-generic" data-slide-to="0" class="active"></li>
    <li data-target="#carousel-example-generic" data-slide-to="1"></li>
    <li data-target="#carousel-example-generic" data-slide-to="2"></li>
  </ol>
  <div class="carousel-inner" role="listbox">
    <div class="item active">
      <img src="slide-1.jpg" alt="First slide">
    </div>
    <div class="item">
      <img src="slide-2.jpg" alt="Second slide">
    </div>
    <div class="item">
      <img src="slide-3.jpg" alt="Third slide">
    </div>
  </div>
  <a class="left carousel-control" href="#carousel-example-generic" role="button" data-slide="prev">
    <span class="glyphicon glyphicon-chevron-left" aria-hidden="true"></span>
    <span class="sr-only">Previous</span>
  </a>
  <a class="right carousel-control" href="#carousel-exa mple-generic" role="button" data-slide="next">
    <span class="glyphicon glyphicon-chevron-right" aria-hidden="true"></span>
    <span class="sr-only">Next</span>
  </a>
</div>

To setup the Bootstrap Carousel, you need to manually specify all of the html elements that go into it.

The .carousel class specifies the carousel container.

The .slide class specifies how to transition - in this case sliding. The alternative is to remove the class and the images change immediately.

The .carousel-indicators list specifies the position indicators at the bottom of the carousel.

.carousel-inner is the main content of the carousel, and the elements that get displayed are labeled with an .item class. To make sure your carousel displays at all, you have to make sure one of the items has an .active class applied to it.

For moving back and forward in the carousel, we have the .carousel-control class, and the data attribute data-slide with a value of "prev" or "next".

Having specified all your markup, you can initialize the carousel in one of two ways.

1. Apply the data attribute data-ride="carousel". The plugin automatically applies carousel functionality to any markup on the page with that data attribute, on page load.

<div class="carousel slide" data-ride="carousel">

2. Initialize the carousel manually in Javascript

$('.carousel').carousel();

The result for our efforts is a carousel with clickable position indicators, click forward and back navigation, and that gives a baseline level of responsiveness based on screen size (it becomes smaller or larger as the screen size changes). It works the way we expect and gives us a solid carousel experience.

See the Pen Bootstrap Carousel by JP Camara (@jpcamara) on CodePen.

In addition to what i've shown, the plugin also offers a couple events (before and after the carousel slide has occurred), a few configuration options, and an API for manually pausing and moving forward and backward in the slides.

You can see that the functionality is limited, and the setup is very manual. We specify every piece of our html, and the capabilities we have beyond what you saw in the example are limited. What if we want additional transitions? What other capabilities are there for responsiveness? How fluid is the interaction on mobile?

Truthfully, I've never decided to go with the Bootstrap Carousel on any of my projects. The alternatives are just too good, and equally as easy (or easier) to setup. I'd suggest always using one of the alternatives, unless you have a real restriction on a project and can't bring in outside libraries.

Flexslider

"Best responsive slider. Period."

The first alternative to Bootstrap i'm presenting is Flexslider. Their boastful claim aside, it is a great carousel library and the functionality it delivers is pretty compelling.

Flexslider is a robust solution and was my go-to for a long time. It offers tons of flexibility, works great on mobile, has integrated swiping and comes with more levels of responsiveness.

Let's look at an example of our markup.

<div id="flexslider-carousel" class="flexslider">
  <ul class="slides">
    <li>
      <img src="slide-1.jpg" />
    </li>
    <li>
      <img src="slide-2.jpg" />
    </li>
    <li>
      <img src="slide-3.jpg" />
    </li>
  </ul>
</div>

The thing we immediately notice is the simplicity of what we need to setup. Whether this is a good thing or not is a matter of preference. The Bootstrap markup is very manually specified, but that gives a decent level of control over exactly what is presented. With Flexslider, it does most of the setup for you and you just specify a small set of markup for it to enhance. It's something of a Convention over Configuration approach - we provide Flexslider with a simple, consistent structure and it will handle the rest for us. Control over the output past that point comes in the form of configuration options and CSS styling. Personally, I prefer the slimmed down setup so I'm a fan of the Flexslider approach.

The conventional Flexslider setup contains the following pieces.

An outer container with a class of .flexslider.

An inner list of elements, with a class of .slides. This contains typical li items. Whatever content they contain is the content of the carousel.

And that's it. A couple classes and a list. Having specified that markup, you can initialize the plugin by using the Flexslider API. We'll also pass in a configuration option of animation here, to achieve the same transition that our Bootstrap example had.

$('.flexslider').flexslider({
  animation: 'slide'
});

The results of that effort in this case result in an experience similar to what the Bootstrap plugin offered. It has position indicators, forward and back navigation, and responds to screen size.

See the Pen Flexslider Carousel by JP Camara (@jpcamara) on CodePen.

So if we're getting a very similar experience, is the only benefit we get is that there's less markup? Isn't that just a matter of preference?

Probably the most important benefit you get, before even digging deeper into configuration options and the API, is that the mobile user experience is much closer to a "native" feel.

Click Delay

The back and forward indicators (and the position indicators) in the Bootstrap Carousel suffer from what is known as "click delay". A common problem on mobile touch devices is that clicking things like links is hampered by a 300ms delay. This is a problem caused by the mobile browser not knowing whether your tap on the screen is meant to perform an action, or if you're in the process of double tapping to zoom in on some content (a common mobile browsing convention for zooming in on content that is not responsive, invented by Apple and replicated by every smartphone after them). So it mitigates this by enforcing a 300ms delay to determine your tap intention (you can read a more in-depth discussion of this topic here).

This may not sound like a big deal, but as I mentioned in the Bootstrap Loaders post, humans can perceive latency in as little as 100ms. 300ms leads a person to question whether their interaction even registered at all, and is a suboptimal user experience (imagine going through your mail on your smartphone, and every time you clicked an email it took 300ms to respond in any way - you'd probably drop the app, and the phone as well if everything were like that).

In Flexslider, click delay is not an issue. When on mobile, Flexslider uses touch events to control forward, back and positional navigation. This way the latency is non-existent and the interaction is immediate - your carousel moves forward and back as soon as the user taps it.

Swipe to navigate

In addition, a common pattern users have come to expect on mobile is the ability to swipe through things. It has become a natural expectation that many elements you interact with are capable of receiving, understanding and reacting to swiping and dragging gestures. With Bootstrap, the functionality is absent. With Flexslider, the user's gesture registers immediately and can be used to either swipe quickly through items, or slowly drag between them.

Animations

The animations in Flexslider feel more natural than the Bootstrap equivalent and the position indicators give a more interactive feel to the items when clicked.

The first point is around the way we animate in between slides in our carousel, which has to do with what the animations "easing" is. "easing" basically has to do with how objects move - the Bootstrap easing is "linear", meaning the animation progresses at a constant pace. Since most things don't move that way (at a constant pace), it doesn't feel as natural. It doesn't feel "wrong", but when we see the Flexslider animation in comparison, there's a more fluid and understandable feel to it. The Flexslider easing is called "swing", and it means the animation progresses slower at the beginning and end (you can see examples of lots of different easings here. I think "easeQuadOut" is the equivalent of "swing" in this case).

The second point is about how the positional indicators work. If you have more than two slides in your carousel, Bootstrap never changes the positional indicator behavior. I'm on the first slide and I click the third slide, I'm animated automatically to the third slide. There's no sense of movement or position in the animation. In addition, it's somewhat confusing if i've gone through all the slides, because casually sliding in the third slide in this case causes me to wonder what happened to the second slide - we've bypassed it entirely. In Flexslider, the slides are all quickly animated through, so you see that you're actually progressing back to the first slide rather than being immediately transported there.

These might be more a matter of preference, but I think the difference is noticeable and an improvement to the experience.

Digging Deeper into Flexslider

For me, the improved mobile experience around gestures and tapping would be enough to warrant using Flexslider over Bootstrap. But i'll talk about a few more configurations and API pieces before leaving the rest of the exploration to the reader.

An Example

$('.flexslider').flexslider({
  animation: 'fade',
  selector: '.slides div.slide',
  end: function () { alert('last slide!'); }
});

Here we're modifying just a few options to add/change some functionality.

animation allows you to specify how the slides will animation. While not wildly expansive, you can choose "fade" in addition to "slide".

selector lets you change some of the convention aspect of Flexslider. The convention is to specify a list item li, but here you can modify things so more types of markup are capable of being slides.

end is one of several events, in this case being fired when the carousel reaches the last slide. It's useful for a plugin to have points to hook into throughout its lifecycle - you'll inevitably find yourself in a place where knowing the state of your plugin is important due to a project requirement.

Flexslider comes with many, many options, but you don't need to worry about them until you need them. I'd approach it in the simpler form to start, and add more capabilities when necessary. You can see the Flexslider homepage for more interactive examples.

Slick

"the last carousel you'll ever need"

While probably not the last carousel you'll ever need, since I learned about Slick it is becoming my new favorite option.

As with Bootstrap and Flexslider, let's start off by looking at a markup example.

<div id="slick-carousel">
  <div><img src="slide-1.jpg"/></div>
  <div><img src="slide-2.jpg"/></div>
  <div><img src="slide-3.jpg"/></div>
</div>

It seems we're getting progressively simpler. In the case of Slick we have even less to define and even more convention than Flexslider. The conventional Slick setup contains the following pieces.

A container element of some kind. Since we hand the element to the Slick API, it doesn't really matter what it is, and it doesn't require any special classes.

div elements to indicate the slides inside. This was something of a strange convention to me since I think of li elements when thinking of slides, but it's the default. No extra classes required, it just finds the child elements and makes them slides.

After that, we initialize it with a call to the Slick API. We'll include the dots configuration here to be consistent with Bootstrap and Flexslider (it's disabled by default).

$('#slick-carousel').slick({
  dots: true
});

Again, we have an experience similar to Bootstrap, and now Flexslider - forward and back navigation, responsive sizing and position indicators.

See the Pen Slick Carousel by JP Camara (@jpcamara) on CodePen.

So surely we're hitting an impass? Bootstrap was ok. Flexslider was alot better. How could Slick be better than what we've seen so far?

A Plethora of options

Slick is highly configurable. Not only does it have a lot of options, it also comes with additional functionality that neither Flexslider or Bootstrap offer.

Responsive Display

Flexslider and Bootstrap come with fairly basic responsive capabilities. Bootstrap gets smaller or larger as your shrink or expand your screen size. Flexslider does the same but also offers some variability in the width and margins of its slides, making it seem a bit more responsively flexible. But Slick takes this configuration to a totally different level.

Slick allows you to setup a responsive configuration property, which takes an array of objects defining how it is supposed to behave at any breakpoint you define.

$('#slick-carousel').slick({
  slidesToShow: 5,
  responsive: [{
    breakpoint: 1024,
    settings: {
      slidesToShow: 3,
      dots: true
    }
  }, {
    breakpoint: 480,
    settings: {
      slidesToShow: 1
    }
  }]
});

In this example, we're specifying a responsive array that contains an object for each breakpoint where we want a different behavior. Inside the object we define a breakpoint property. Effectively we are creating javascript media queries, and these allow us to change how the Slick carousel behaves at each breakpoint. In this example i'm saying that above 1024, I want to show 5 slides, at 1024 and below I want to show 3 slides and show dots, and at 480 and below I want to show 1 slide with no dots.

Your responsive control is virtually limitless in this case. You have as much control over how your carousel works as you have over how your CSS does. Every option you've specified as the base of your Slick carousel can be configured at the different responsive levels.

Drag on Desktop

Sometimes, even in a Desktop browser, you want to drag between images in a carousel. At the time of writing this, the Flexslider folks aren't particularly interested in that feature, so if you want desktop dragging between carousel items you're out of luck with them.

With Slick, it's enabled by default. This may not seem immediately useful, but there are lots of use cases. For one thing, users used to mobile interaction sometimes bleed that mental model into the browser, expecting things to work similarly between the two. Another example would be where you aren't creating a carousel for the standard "slide through a bunch of images or text" experience - perhaps your carousel is meant to act like a timeline where users can drag between events. In these cases, you have so many features that work for your use case (forward and next, moving to points in the future or past, fluid transitions and good mobile interaction), and all that might remain to you is the desktop dragging behavior. Slick has you covered.

Lazy Loading

Slick also has some support for out of the box lazy loading. It's a simple setup, and requires a couple additional configuration options.

<div id="slick-carousel">
  <div><img data-lazy="slide-1.jpg"/></div>
  <div><img data-lazy="slide-2.jpg"/></div>
  <div><img data-lazy="slide-3.jpg"/></div>
</div>

$('#slick-carousel').slick({
  lazyLoad: 'ondemand'
});

Our img tags now have data-lazy data attributes, instead of a src attribute. If you included the src attribute, the image would always load regardless of your intent. This way Slick can pull the attribute out and apply it to src at the appropriate time.

Our Slick initialization now has a lazyLoad option. Supply 'ondemand' for it to load the images as you progress through the carousel.

Considering that web page size has been on the rise and 56% of page size tends to come from images - lazy loading is a nice option to have with little to no additional effort.

Digging into Slick

Slick has way more options and features than i've mentioned here, so check out the demos it offers for some more impressive functionality. I've just offered a few features that i've found useful recently - there's plenty more to try out.

Angular

For the angular inclined, here are some options for Slick and Flexslider.

Slick: https://github.com/vasyabigi/angular-slick

Flexslider: https://github.com/thenikso/angular-flexslider

Conclusion

We want to make websites that work great on all devices. We want to be responsive and capable and we want the interaction to be natural. Working with the Bootstrap Carousel our options are limited (by design) and our mobile interaction is bare. We can do better. With Flexslider and Slick we can create a Carousel experience that feels native and responsive to the user.

Have any of your own suggestions? Have a qualm with Carousels in general? Let me know in the comments.

Better Than Bootstrap Javascript: Loaders

Mon, 09 Feb 2015 11:03:39 -0400

Bootstrap is an awesome tool, and something I use all the time. The ability to have so much css scaffolding in place for you to use is a huge time-saver and makes building responsive sites attainable in a more reasonable, repeatable and budget-friendly way.

On top of the CSS functionality, Bootstrap comes with a suite of javascript plugins that solve some common requirements for your UI. Including a modal, popover, carousel, tooltip and loader (among other elements), it really is a handy way to get some dynamic functionality out of the box, and get up and running quickly.

But when you're such a large framework, and you encompass so many different areas - that means you're going to be great for some things, and average (or poor) for others. When the parts are taken individually, their flexibility, capability and quality fall short of other tools that are built with only one specific purpose in mind.

When I'm working on a non-Bootstrap project, it's rare that I would reach for a Bootstrap javascript component to complete a task - there are just better tools suited to specific jobs. Even when working on a Bootstrap project specifically - you might find a plugin lacking in some way, and want to try something else.

This is the first in a series about Bootstrap Javascript components, and some high quality alternatives.

First stop, user action feedback.

Starting small

This first comparison is of a small area of Bootstrap functionality, specifically centered around buttons. In Bootstrap, you can manipulate "states" on a button and give it behaviors outside of the standard button click behavior. They use it, for instance, to create buttons that can behave like checkboxes or radio buttons. In addition, they demonstrate behavior where a button can react to the users click, and provide the user feedback in the form of a "loading" state.

For instance, a user could click a button with the text "Loading state", and once clicked the button is disabled and its content changed to "Loading...".

<button type="button" id="myButton" data-loading-text="Loading..." class="btn btn-primary" autocomplete="off">
  Loading state
</button>

$('#myButton').on('click', function () {
  var $btn = $(this).button('loading');
  setTimeout(function () {
    $btn.button('reset')
  }, 2000);
})

In our HTML, we've provided a button that has attributes about how to behave when in a "loading" state. data-loading-text provides the value Bootstrap will swap out when we switch to that state, and Bootstrap automatically disabled the button for us until we reset it. We just use the button api to tell it when we want it in a state of "loading", and when done tell it to "reset". Resetting it will bring us back to our original state: an enabled button with the text of "Loading state".

Here's an example of the concept in action:

See the Pen Bootstrap Loader by JP Camara (@jpcamara) on CodePen.

In practice I've used these loading states as an immediate indicator to the user that something has happened. Their action resulted in a process being started. For instance, if the user clicks a button and I make an ajax request, changing the state of the button to loading lets them know that something is going on (rather than sitting their wondering what might be happening, and if something went wrong). Humans can perceive latency in as little as 100 milliseconds (and maybe even less), so giving them that immediate feedback is important.

A more dynamic indication

The ability to add this loading state gives the user immediate feedback, and doesn't leave them guessing about whether their click registered. But it's somewhat of a jarring interaction (the button immediately changes to the loading state), and isn't far beyond what we could do manually ourselves (disabling a button and changing its display text isn't difficult javascript).

I think we can do better.

Ladda UI

Ladda is a UI concept created by Hakim El Hattab that "merges loading indicators into the action that invoked them". In our case, that gels nicely with the concept we've already been promoting by using a "loading" state on our buttons to give the user immediate feedback. But the Ladda concept takes that a step further, and introduces multiple animated feedback mechanisms centered around the button.

Using the Ladda library from Github, the approach we take here looks similar to the Bootstrap states.

<button class="ladda-button btn btn-primary js-expand-right" data-style="expand-right">
  <span class="ladda-label">Submit Right</span>
</button>

var $expandRight = $('.js-expand-right').ladda();
$expandRight.on('click', function () {
  $expandRight.ladda('start');
  setTimeout(function () {
    $expandRight.ladda('stop');
  }, 2000);
});

Our HTML looks a little different - we're no longer specifying loader text and we now have a new attribute data-style. The data-style attribute specifies how the button will react when Ladda is enabled - in our case we're telling it to "expand-right" when the user clicks it. There are several "styles", including expand-left, expand-right, slide-up, and slide-down, among others (see Hakim's demo page for all styles available).

In addition, we've now split our button and inner text into two pieces: the outer button layer with a class of ladda-button, and the inner span layer with a class of ladda-label.

Our Javascript code is straightforward, and i'm taking advantage of the jQuery plugin Hakim provides to make the setup even simpler. We grab the element using jQuery, apply Ladda to it using the ladda method, and then when clicked simply tell Ladda to start and stop at the appropriate times.

Here is an example in action, also including how to indicate incremental progress as some background action happens.

See the Pen Ladda Loader by JP Camara (@jpcamara) on CodePen.

Perceived Performance

What we're going for with these two approaches is perceived performance. We haven't actually improved the performance of our code or our server by including this, but the user feels like their action is progressing faster. Simply clicking on a button with no feedback (except maybe a loading icon in the tab of your browser) leaves the user feeling like things are moving slowly, or are possibly stuck.

Angular

The examples provided here are based on using standard Bootstrap and Ladda. If you're an angular user and want to try this, here are a couple options

Angular Ladda provides an angular implementation on top of the Ladda plugin from Hakim

This gist provides an example of just creating a directive wrapper you could copy into your own project.

Conclusion

The feedback Ladda offers is more visually appealing and gives a deeper sense of progress than what Bootstrap provides out of the box. Using this approach we can make the user interaction more pleasant and responsive than what we were capable of with Bootstrap alone. I'm interested to know if you agree, or if you have any other alternatives that you've used in your work. Let me know in the comments!

Next up, we're going to take a look at Carousels and what alternatives there are to Bootstrap's native carousel component.

Selecting carefully with Laravel joins

Mon, 19 Jan 2015 02:00:39 -0400

Laravel comes packaged with an ORM called Eloquent, which is one of the better PHP tools for dealing with your database layer using objects. It provides all of the features you'd expect from a modern relational mapper, and cleanly uses some PHP magic to make the process pleasant.

But it has its quirks, and today i'm setting my sights on a not-uncommon operation - the join.

The tables

Let's say we have two tables: users and posts. users has an id and a name. posts has an id, name, content and user_id (a foreign key to the users table). Each table has the following data

users 
id | name 
1  | jp 
2  | cosmo

posts 
id | name         | content      | user_id 
18 | jp's post    | hello there! | 1 
22 | cosmo's post | hi there!    | 2

Here are their definitions in Eloquent

class User extends Eloquent {
  protected $fillable = ['name'];
  public function posts()
  {
    return $this->hasMany('Post');
  }
}

class Post extends Eloquent {
  protected $fillable = ['name', 'content', 'user_id'];
}

With this definition, we can retrieve the first user easily, and the results are what we'd expect

$user = User::first();
print $user->id;   //prints 1
print $user->name; //prints "jp"

Where it gets tricky

Retrieving a User object directly is nice, but many times when dealing with the database you'll be issuing joins to retrieve your data. A join is fairly simple in Eloquent

$user = 
  User
    ::join('posts', 'posts.user_id', '=', 'users.id')
    ->where('posts.name', 'LIKE', '%jp%')
    ->first();

The join takes a table name, and then the pieces of the join clause. In this case we're joining on the relationship between the posts.user_id and the users.id. A little bit clunky, but usable.

What isn't usable, are the results.

print $user->id;   //prints 18
print $user->name; //prints "jp's post"

The object looks the same, but we're getting what seems to be invalid results. When we ran it before, we printed an id of 1 and a name of "jp" - now we're seeing an id of 18, which isn't even available in the users table at all. And the name "jp's post" is the from the posts table. What's going wrong?

Under the hood

By default when Eloquent issues a query, it selects all columns

User::all(); //SELECT * FROM `users`

When you perform a join, it continues issuing the same select call, additionally adding the join and where clauses. So our join from earlier results in the following query

SELECT * 
FROM users 
INNER JOIN posts ON users.id = posts.user_id 
WHERE posts.name LIKE '%jp%'

The result is a row with two id and name columns. Raw sample output from a database tool would look something like this

id | name | id | name      | content      | user_id
1  | jp   | 18 | jp's post | hello there! | 1

This means that when Eloquent parses the results into the User object, it's contending with two versions of the same column name. Most likely it's using a hash internally, and the second key overrides the first. The result is a User object with the id and name of the related Post.

Fixing the issue

The solution turns out to be very simple. You have to be specific with your select clause.


$user = 
  User
    ::join('posts', 'posts.user_id', '=', 'users.id')
    ->where('posts.name', 'LIKE', '%jp%')
    ->select('users.*')
    ->first();

Now our select is specifically selecting columns from the users table only, and our results are consistent with our expectations again

print $user->id;   //prints 1
print $user->name; //prints "jp"

How Rails does it

One reason this tripped me up in the first place, is coming to Laravel from Ruby on Rails. In Ruby on Rails, this problem is taken care of for you.

User.joins(:posts).where(name: '%jp%').first

SELECT users.* 
FROM users 
INNER JOIN posts ON users.id = posts.user_id
WHERE name LIKE '%jp%'

I assumed a similar behavior in Laravel without ever checking the results. Personally I think the Rails behavior is the more sensible default, since it keeps your class mapping directly to your table (unless you explicitly tell it otherwise using #select).

Why not use #with?

For people used to using eloquent, it may be suggested to instead use the #with method. Using #with does technically fix the select issue, but the use case it solves and queries it generates aren't the same.

#with allows you to work around the N + 1 problem and also can give some form of filtering. To achieve a similar affect to the join clause using #with

User::with(['posts' => function ($query)
{
  $query->where('name', 'LIKE', '%jp%');
}])->first();

Using #with you might expect a similar result to the join, but the queries generated work differently.

SELECT * FROM users
SELECT * FROM `posts` WHERE `posts`.`user_id` IN (1) AND `name` LIKE '%jp%'

#with generates two queries, so if you're trying to perform an operation on the data with one call, it's not the way to go. It first fully retrieves the users table information, then uses that to construct a query on the posts table. It's less efficient and may not always be accurate.

Also - it's up for debate, but I find that adding a where clause to a #with method is more complicated to read.

Conclusion

The way joins work in Eloquent has bitten me a few times, and it teaches you to be careful about making assumptions when moving around between frameworks and abstraction layers. Laravel feels right at home when you're used to using other systems like Rails, but truly understanding the framework you're using and implications its tools have is important to making sure mistakes like this don't slip through.

Using Famo.us with the Ionic Framework

Mon, 12 Jan 2015 06:46:02 -0400

Ionic and Famo.us are Javascript frameworks that aim to bridge the gap between the web and native app world, so that web technologies can compete with the native experience.

Ionic

Ionic is focused on on creating beautiful hybrid mobile apps in HTML/CSS, and their ecosystem, tooling and library provide a great experience that allows web developers to leverage their skills quickly and effectively in the mobile app space. But despite the cohesive and rich experience they can provide, extending their framework solely relies on the use of HTML5 and CSS3 - without their internal focus on performance and native feel, components you extend it with may not be the same quality or seamlessness as what the Ionic team can provide.

Famo.us

Famo.us is a framework obsessively concerned with making the browser interactive UI experience fluid, effortless, and fast. They haven't provided the same kind of ecosystem and tooling that Ionic has (yet), but their competitive edge is definitely in the power their layout engine provides. Where Ionic has lots of built-in, native feeling components, Famo.us instead provides all of the tools you need to build a native feel yourself on a foundation that is designed to perform. They've worried about what makes the browser experience as fluid as possible, so if you manipulate it through their API, you don't have to.

Working together

Most articles you see only want to compare the two and claim which is 'better'. I think a more interesting and beneficial path is to see if we can use the two to play off each other: Use Ionic to scaffold your applications and easily achieve a native theme and feel, while using Famo.us to power more complex interactions that Ionic doesn't provide an easy capability for.

Getting Setup

For this tutorial, we're going to use the Ionic Beta 14 framework/tooling, and Famo.us 0.3.2. Since Ionic is an AngularJS project, we'll use the Famo.us/Angular project which provides easy-to-use directives for controlling Famo.us from your HTML views.

To install the ionic tools, you'll need npm. We're also going to throw in bower for installing famo.us.

npm install -g ionic bower

Now that we have the ionic cli installed, we'll use one of their starter templates as the base of our project. The sidemenu template provides a sliding side menu with routing and a modal example.

ionic start famously-ionic sidemenu
cd famously-ionic

You can run ionic serve to make sure everything installed correctly, and have a live-reloadable version of your code running in the browser.

ionic serve

Now to install famo.us and the angular integration, and save it to our bower.json file

bower install famous-angular --save

If asked which angular version to choose, choose from the 1.3.x releases.

In your www/index.html file, add in references to the famous code you installed with bower.

Go into www/js/app.js, and add famous.angular as a dependency.

angular.module('starter', ['ionic', 'starter.controllers', 'famous.angular'])

Now we're ready to embed Famo.us content into our views. For this example, all we will do is embed a Famo.us Surface, which is the most basic piece of renderable content in the Famo.us layout engine.

Go into www/templates/playlists.html, and replace the entire content with this

<ion-view view-title="Famo.us">
  <ion-content>
    <fa-app style="height: 200px">
      <fa-surface fa-background-color="'red'">
        Famo.us, deep in the heart of Ionic.
      </fa-surface>
    </fa-app>
  </ion-content>
</ion-view>

What we've done here is setup one of the simplest Famo.us examples you can create. Each snippet of Famo.us content you display using their angular integration begins with the fa-app directive.

<fa-app>...</fa-app>

To display any actual content to the screen, it must be wrapped in a Surface. Using the fa-surface directive makes that very easy to do, and in our example simply contains a small snippet of text which will be rendered in the browser.

<fa-surface>...Content to render...</fa-surface>

Following these steps, you should have an embedded Famo.us surface in your Ionic app!

Conclusion

This hardly scratches the surface of what could be possible in using these two frameworks together, and i'll be exploring each individually and combined in future posts. For now, head over to my Famo.us/Ionic Demo project on Github to get a fully working version of what you've read here with some additional examples. Feel free to comment if you have any suggestions or thoughts on what an Ionic/Famo.us combination could mean for you!

Resource Roundup - Week of 1/4

Fri, 09 Jan 2015 02:00:26 -0400

[text-blocks id="knowledge-roundup-preface"]

Busy week - so I did a little less reading than usual. I'm also finally reading "Think and Grow Rich" by Napoleon Hill so that should have some notes in an upcoming week.

Articles

http://blog.phusion.nl/2014/12/22/phusion-passenger-5-beta-2

Beta 2 release of Phusion Passenger 5 (previously called "Raptor"). If their speed claims are really accurate, the RC and 1.0 releases could be game changers for some apps. I tried it out and hit issues with beta 1, so I may wait until an RC is available to try again.

https://blog.engineyard.com/2014/ruby-isnt-dead

"The reports of my death have been greatly exaggerated..."

https://www.linkedin.com/pulse/north-koreas-parallel-universe-5-stats-ian-bremmer

Articles abound about North Korea in the wake of the supposed Sony hack. But still, interesting stats on a strange and foreign land.

http://brainworkshop.sourceforge.net/

Capable of increasing your IQ, or so Bulletproof Exec says. I'm going to try out one of the app versions, of which there are many.

https://econsultancy.com/blog/65511-hamburger-menus-for-mobile-navigation-do-they-work

Good overview of some analysis done on whether the "hamburger" menu icon that has become so ubiquitous is actually a good fit for user engagement.

https://www.discovermeteor.com/blog/latency-compensation/

I've been digging more and more into meteor lately, and I see it as a powerful tool for rapid, featureful development. Here's another article from the Discover Meteor folks, this time about Latency Compensation - which helps Meteor apps feel more responsive by simulating server interaction before the server actual finishes responding.

http://www.montulli.org/theoriginofthe%3Cblink%3Etag

If you were wondering why in the world the blink tag ever existed.

http://well.blogs.nytimes.com/2013/06/21/how-the-hum-of-a-coffee-shop-can-boost-creativity/

I've been a fan of focus@will for awhile now. Their new Cafe Focus and Cafe Creative channels surprised with me with how effective they are. This article is an older one that goes into some of the concepts that the focus@will channels were probably built from.

http://venturebeat.com/2015/01/06/oregon-trail-and-nearly-2400-classic-games-now-available-to-play-free-in-your-browser/

And now for the opposite of focus.

The impact of Strict Mode on

Mon, 05 Jan 2015 02:00:00 -0400

Last week in my post The context of "this" in Javascript, I gave an example to test the readers understanding of the topic:

var global = 'global example'; 
function globalExample() {
  return this.global;
}
console.log(this.global);     //outputs 'global example'
console.log(globalExample()); //outputs 'global example'

var obj = {
  global: 'obj example',
  example: globalExample
};
console.log(obj.global);    //outputs 'obj example'
console.log(obj.example()); //outputs 'obj example'
    
var obj2 = {
  global: 'obj2 example'
};
console.log(globalExample.call(this)); // outputs 'global example'
console.log(globalExample.call(obj));  // outputs 'obj example'
console.log(globalExample.call(obj2)); // outputs 'obj2 example'

In evaluating my post I realized a few things about that example. One, that it doesn't work when in Strict Mode. Two, that there are impacts of Strict Mode which never occurred to me. Three, I probably shouldn't encourage naming variables global.

Strict Mode

I write all my Javascript in Strict Mode because it helps to keep your code from falling into some common pitfalls, removes some of the bad parts of Javascript, and better reflects the way Javascript should/will work in future releases.

An example of imposing Strict Mode on your code is as follows:

'use strict'; //Enables strict mode
variableWithoutVar = 'content'; //Uncaught ReferenceError: variableWithoutVar is not defined

In that example, variableWithoutVar is having a value set to it before being defined. This is invalid, and without Strict Mode can result in surprising behavior (variables getting attached to the global object). Check out the Mozilla Developer Network article for many other areas that Strict Mode affects.

Strict Mode and this

What didn't occur to me when writing my example, is that with Strict Mode enabled this changed in the browser. My example breaks.

'use strict';
var global = 'global example'; 
function globalExample() {
  return this.global;
}
console.log(this.global);     //outputs 'global example'
console.log(globalExample()); //Uncaught TypeError: Cannot read property 'global' of undefined

When I applied Strict Mode to my example, the this reference I was using in globalExample is now undefined. There are a couple reasons why this behavior makes sense.

First, it forces the developer to code their functions more intentionally. When you want an object context for a function, specify it. Javascript makes this easy to do, and it would be very rare that you'd even attempt the example I made (I only realized the possible error of it now, after writing in Strict Mode for years). You get it supplied for you when defining a method on an object, or when specifying an object when using call or apply.

Second, it removes best intentioned behavior only useful to beginners. According to the Mozilla Developer Network reference, the value passed as this was being coerced into an object (so this became a reference to the global object). Behaviors like this are often in languages to help beginners deal with unexpected behavior. "What if the user doesn't know to perform a particular behavior? We'll default it for them". Making these kind of decisions often result in unexpected and insecure consequences, and other languages have had to deal with the repercussions of their helpfulness as well (see PHP's magic quotes for example).

Sorry beginners

The only downside I see to the change in behavior is probably the reason it worked the way it did before Strict Mode. The way it now works is a little confusing to a new Javascript user.

I can set this properties at the root global scope, either by defining a variable

var myVariable = 'foo';

Or by setting it as a property of this.

this.myVarialbe = 'foo';

But when in a function call, I can't use it.

this.myVariable = 'foo'; 
function example() { 
  return this.myVariable; 
} 
example(); //Uncaught TypeError: Cannot read property 'myVariable' of undefined

But then I can use it if I remove this

this.myVariable = 'foo'; 
function example() { 
  return myVariable; 
} 
example(); //no problem

So is it a closure? Is it implicitly rewriting the definition to use the global object? Truthfully, I don't know the specifics of that answer (but if you know how the js engine interprets it, please comment!). What I know is that it's the unfortunate quirky looking consequence of introducing a more stable behavior into the language.

An aside about global as a variable name

In my example, I used global as a variable name to express the nature of the variable. It was being used globally, and was defined in multiple contexts with different values.

It was an unfortunate choice if you were to try that example in node.js, because it would overwrite the node.js reference to the global object. In the browser, window references the global object. In node.js, global does. So while global is a valid variable identifier, avoiding its use could probably be considered a best practice (at least when making Javascript examples).

For more on the global object, you can read about it in Speaking JS.

Conclusion

When writing Javascript, whether it's production code, a toy example or application, or just writing a snippet in a blog post, there's no practical reason not to write your code in Strict Mode. It's the way Javascript should be written moving forward so all examples should comply with it.

Do you write all your code in Strict Mode? Feel there's a reason not to? I'm happy to debate the topic if you want to post a comment!

Resource Roundup - Week of 12/28

Fri, 02 Jan 2015 02:00:00 -0400

[text-blocks id="knowledge-roundup-preface"]

Articles

http://www.vanseodesign.com/web-design/html5-structural-elements

This is exactly how I feel about HTML5 elements. I've been reading about the benefits of a semantic web for years, but when will we actually see it being used consistently or being truly embraced by big players.

http://etfdailynews.com/2014/12/29/billionaire-warns-of-massive-crash-that-will-wipe-out-americas-colleges/

Very interesting perspective that makes load of sense to me. When someone can be over 100k in debt and graduate with a degree in liberal arts, there's a problem.

http://html5hub.com/deploying-hybrid-html5-games-on-the-desktop-using-node-webkit

Cool approach to deploying HTML5 games as desktop applications.

http://csstriggers.com/

Amazing reference for how different CSS properties affect browser performance.

http://quickleft.com/blog/6-easy-ways-to-prevent-your-heroku-node-app-from-sleeping

Keeping a free Heroku dyno running is a popular topic. This is from a node.js perspective, but gives some options outside of that as well.

http://andrzejonsoftware.blogspot.co.uk/2014/12/how-to-write-ruby-related-book-tools.html

Some nice advice and options for writing books in general, and ruby in particular.

http://www.gatesnotes.com/About-Bill-Gates/Year-in-Review-2014

Nice to end 2014 on a positive note.

Sites

http://codyhouse.co/library

Some great articles and demos that offer practical advice and solutions for implementing useful web features using newer browser features.

http://pomodorotechnique.com/

Been getting into the Pomodoro technique lately for focusing my time. Here's a little intro.

Tools

http://materializecss.com/

I'm new to the Material Design concept, but i'm very intrigued by it. This is a framework that implements those concepts. Not sure if i'd use it just yet, but it feels great. The Material Design concept seems to give the web a very native feel, which is definitely an attractive concept.

https://github.com/mozilla/shumway

Because people will write anything in Javascript.

The context of

Mon, 29 Dec 2014 02:49:28 -0400

Understanding the different contexts of this in Javascript is an important step towards truly understanding the language. In addition to its importance, it's also something that has been written about to death.

If you're using Javascript, and don't fully understand how the this keyword in javascript works, here are some great articles that will give you the understanding you need.

this Resources

A brief overview of different "this" scenarios, from a good general Javascript overview called "Javascript Garden"

http://bonsaiden.github.io/JavaScript-Garden/#function.this

A longer, but still higher level overview

http://davidshariff.com/blog/javascript-this-keyword/

Another long-form explanation, with some additional technical detail

http://unschooled.org/2012/03/understanding-javascript-this/

And a step by step, in-depth article

http://www.2ality.com/2014/05/this.html

Checking your understanding

If you think you understand this, make sure you understand why each of the following work the way they do. If the examples make sense, you probably have a pretty decent understanding! If after those articles you still don't fully understand, leave a comment or contact me for more explanation.

var global = 'global example'; 
function globalExample() {
  return this.global;
}
console.log(this.global);     //outputs 'global example'
console.log(globalExample()); //outputs 'global example'
    
var obj = {
  global: 'obj example',
  example: globalExample
};
console.log(obj.global);    //outputs 'obj example'
console.log(obj.example()); //outputs 'obj example'
    
var obj2 = {
  global: 'obj2 example'
};
console.log(globalExample.call(this)); // outputs 'global example'
console.log(globalExample.call(obj));  // outputs 'obj example'
console.log(globalExample.call(obj2)); // outputs 'obj2 example'

Knowledge Roundup - Week of 12/21

Fri, 26 Dec 2014 12:35:46 -0400

[text-blocks id="knowledge-roundup-preface"]

Articles

Nasa Wants to Colonize Venus

I feel like i've read and listened to this topic a few times recently - but this is the first i've heard of NASA talking about it. Not just talking about it - but having a pretty established plan for how they will do it, and what it means. It's exciting that these endeavors could bear fruit in our lifetime.

https://www.linkedin.com/pulse/scariest-part-sony-hack-jay-yarow

Maybe this is the way our emails and interactions become historical record like the letters and documents of people in the past. I hope people understand what memes are in the future.

https://medium.com/elissa-shevinsky/in-plain-english-five-reasons-why-security-experts-are-skeptical-that-north-korea-masterminded-the-24509b4b8331

Interesting view from the perspective that the Sony hack wasn't even a North Korean exploit at all. I started following the Sony news a bit later (by later that means a few days) and by then North Korea was all anyone talked about. There are some compelling points in here that it had nothing to do with North Korea at all.

https://www.ruby-lang.org/en/news/2014/12/25/ruby-2-2-0-released/

New Ruby release! I'm interested to see the impact of the new incremental garbage collector.

Something about seeing so much mention of Rails in a Ruby release post is sort of discouraging, but I understand catering interest to what is probably your biggest user base.

Sites

http://microjs.com/

Handy search engine for finding narrowly focused js libs with minimal dependencies to solve specific problems.

http://youmightnotneedjquery.com/

You might not need jquery isn't brand new, but I hadn't seen it before. It gives a great overview of alternatives to common jquery approaches when dealing with the DOM. Their writeup is sensible as well - if you have jQuery, you can keep using it. But if you don't have jQuery in your project, or you're authoring a Javascript plugin - see if you can go without by taking some of these approaches.

Browsers are definitely getting closer to feature parity in many areas, so this kind of site is useful, especially for someone that's been used to inconsistency for so long and has used jQuery for so many projects (see: me). There is a word of warning for this approach though - the consistency and browser bug smoothing jQuery provides is not just about Internet Explorer or feature parity. With the pre-release post about jQuery 1.12 they included a small section in there with an important point - browser inconsistencies aren't just about IE anymore.

Tools

https://github.com/abeisgreat/FiltrES.js

A tool for parsing javascript snippets and producing ElasticSearch JSON queries as the output. If you've dealt with ElasticSearch before, it's powerful and fast, but the query language really takes some getting used to. Being able to write it as more conventional javascript expressions could be a big help.

http://robertleeplummerjr.github.io/thaw.js/

"Synthetic asynchronous processing in javascript". Seems to take advantage of setTimeout(fn, 0) behavior (interestingly not using setImmediate) to defer actions happening until the next tick of the event loop. The result is faster DOM interaction. Seems like early days but it will be interesting to watch. However, since the point of the library seems to deal with DOM interaction, fastdom might be the superior and more mature approach.

https://github.com/RubaXa/Sortable

Super sweet, touch tuned sorting lib. The interaction and animation of it is really slick.

https://github.com/taye/interact.js

Interact.js took me by surprise in its depth of features and browser compatibility. I used it to implement a custom timeline which needed dragging, axis-locking and good touch support - their API is pretty intuitive and I only encountered a couple confusing scenarios for my use case. Part of the confusion wasn't on them, but just my expectation that they'd handle more of the actual dragging interaction for me (like draggabilly does)

Prototyping quickly with the PHP built-in server

Sun, 21 Dec 2014 02:00:58 -0400

Sometimes you want to get a server up and running as quickly as humanly possible. Whether it's trying out a js library where the demos only work with a server, developing a static front-end (stand-alone or for integration with an API) or getting a PHP app up and running without setting up apache or nginx - being able to quickly start a local server is a nice convenience. Here's one of my goto approaches for getting up and running quickly.

PHP Built-in Server

Since the release and general use/availability of PHP 5.4, there's been a built-in PHP server available from the PHP CLI. To run it, from the terminal/command line all you need to type is

php -S localhost:8000

and boom, you're done! You can now access anything in that directory or subdirectories as you would from a normal web server. Naturally, getting to this point has two main pre-requisites - your PHP version must be 5.4+ (php -v), and PHP must be on your path.

Command Explained

Breaking it down, this is taking advantage of the (very not-production ready) built-in server that PHP 5.4+ offers. If you were to type

php -h

in the terminal, one of the instructions provided would be

php [options] -S <addr>:<port> [-t docroot]

This specifies the structure of starting the built-in PHP web server. In reference to our example: the -S specifies that you're trying to start the built-in server, localhost is the address and 8000 is the port. We're not using the -t docroot option here, but when specified you could use it to create a routing script (for more information on that, see the PHP docs here).

Accessing the server remotely

If you want to access the web server from a remote machine (useful for trying it on your phone, or letting other people on your network access it using your IP), you can specify the address as 0.0.0.0 instead of localhost. For example

php -S 0.0.0.0:8000

More Examples

For more one-line server starters, there's a helpful Github gist here. It doesn't provide any specifics or explanations of the examples, but it gives examples in multiple languages including node.js, ruby, and python.

Knowledge Roundup: Week of 12/14

Thu, 18 Dec 2014 20:10:37 -0400

This is just a dumping ground for articles/books/tools that stuck out to me over the last week.

Most of the articles I read come from some excellent newsletter sources: Javascript Weekly, HTML5 Weekly, Ruby Weekly, Node Weekly, Database Weekly, and the Code Project Newsletter. Books are from a variety of sources, including Achieve Your Goals podcast, Entrepreneur on Fire and the Ameer Rosic Show (among many others and random internet discoveries).

Articles

http://arstechnica.com/security/2014/12/critical-git-bug-allows-malicious-code-execution-on-client-machines/

The official git clients for Mac and Windows are vulnerable to an attack. Time to update. If you're using homebrew, this article will help - http://artarmstrong.com/blog/2014/12/18/install-and-update-to-git-2-2-1-on-mac-osx-10-10-yosemite-using-homebrew/

http://www.computerworld.com/article/2859176/why-google-should-leave-europe.html

Interesting article about the increasing pressure being placed on Google in the EU. I had no idea how wacky some of the stuff was they were doing to google - for instance, forcing them to pay fines for referencing news items from Spanish news sources. Now that they are pulling their news service there, Spain is trying to force them to stay and pay the fines. Nutty.

http://arstechnica.com/security/2014/12/some-100000-or-more-wordpress-sites-infected-by-mysterious-malware/

This is relevant for anyone with the Slider Revolution installed in Wordpress. A colleague of mine had one Wordpress site (of dozens) with that plugin, and it was infected.

http://www.drdobbs.com/architecture-and-design/farewell-dr-dobbs/240169421

Dr Dobbs will no longer be publishing new content. End of an era - I used them for years as a reference and knowledge source. They'll keep the doors open and available with all existing content, so that's the only upside.

http://ionicframework.com/blog/the-final-beta/

The final ionic beta brings some really great updates and performance improvements. I'm just starting to play with it now and I can't wait for the 1.0 release!

http://moduscreate.com/the-state-of-html5-gaming

Nice overview of some of the better HTML5 options for writing games. I'm partial to Phaser myself, and find it to have a really robust, fairly intuitive API. And it's fast and awesome on mobile.

http://www.computerworld.com/article/2860466/coder-sell-thyself.html

Some practical, useful (if not general) advice in here about becoming and growing as a freelancer. Only qualm I have is with the advice to straight up quit your job. I think there is a point to be made about leaving room for opportunity to find you, but you can start slowly by taking on opportunity on your off-work time. Then as you grow in your skills and your client base, taking the leap to full-time freelance is an easier (and more secure) decision. Just don't wait too long, once you've decided it's right for you.

Books

Sleep Smarter - http://www.amazon.com/Sleep-Smarter-Proven-Better-Success/dp/0984574522

Just finished this one up and it was definitely enlightening. Already starting to implement some of this advice - making sure to sleep between 10 and 2 has made a noticeable difference in how I feel in the morning. Probably the most difficult thing to overcome will be reducing "screen" time before bed. TV and Phones/Tablets use near bedtime have a detrimental effect on the quality of your sleep, apparently.

Tools

https://github.com/gajus/brim

iOS7 came with the (developer) useful, but (user) confusing "ui-minimal" viewport tag (see the Safari section in that link). Unfortunately, user confusing beats out developer helpful and it was removed in iOS8 - but some of the behavior of it remains. This library gives some useful tools for understanding when the user has caused a minimal-ui mode to be entered.

https://github.com/Maatwebsite/Laravel-Excel

Sweet little Laravel service provider for interfacing with the PHPExcel library in a Laravel project. Provides a nice facade for the library, and even makes it possible to generate excel files based off of blade templates (containing tables).

https://github.com/zzarcon/focusable

Could be a good basis for a walkthrough library. Helpful to be able to guide a user through the screen, focusing on what aspects are important and describing them.

http://www.appgyver.com/supersonic

Interesting Ionic competitor. Actually uses the ionic styles, but claims to have superior performance. The highlight of it for me was that you can actually manage certain native components as if they were html. Also supports some form of framework agnosticism - though I think to achieve that you'd lose some of their features.

https://github.com/sferik/active_emoji

Because ruby.

Previewing your Github README

Mon, 15 Dec 2014 10:47:11 -0400

When you're deploying to Github, crafting a helpful and well-formatted README file is a great starting point for people interested in your project. There are lots of Markdown preview tools available (just check for plugins or bundles in your favorite editor/IDE, or an online tool like http://markable.in/editor/), but Github does things a little differently with their Markdown flavor and it can be difficult to get it the way you want it without pushing to Github and seeing what needs to be changed.

That's where i've found a tool called grip to be really helpful. From their documentation, it is a "command-line server application written in Python that uses the Github Markdown API to render a local readme file".

To install Grip you first need a python package manager called pip. If you are on Mac OS X or Linux, you probably already have Python installed and pip should be available. If on windows, you'll need to install Python. If you're having trouble getting setup, try the installation instructions here.

Once you have pip, it's as simple as pip install grip.

Now that you have it installed, simply run grip in the same folder as your README and it'll be served up and available in your browser. The default is http://localhost:5000. Now enjoy previewing your Github Markdown before pushing to the repo!

MySQL 5.5 to 5.6 - Type Strictness

Fri, 12 Dec 2014 22:00:12 -0400

If you're upgrading MySQL from 5.5 to 5.6, you may encounter some issues when trying to insert and update data. For instance, trying to insert an empty string into a decimal value may have worked for you in 5.5, but now throws this in 5.6:

Incorrect decimal value: '' for column 'my_decimal_column' at row 1

MySQL 5.6 changed some of its default settings, and one of those changes is how type strictness is handled by default. If you're encountering this issue, the most permanent solution is to change the sql_mode in your MySQL configuration.

This is the default configuration for MySQL 5.6

sql_mode=NO_ENGINE_SUBSTITUTION,STRICT_TRANS_TABLES

Removing STRICT_TRANS_TABLES from the configuration file and restarting MySQL will force it to behave like 5.5, and your strictness issues will be resolved.

Finding your configuration

Where the configuration file lives is dependent on your OS and where you've installed MySQL. For instance, on my Mac OS X machine I used a utility called Homebrew to install MySQL and my settings are located at /usr/local/opt/mysql/my.cnf. Googling for your MySQL configuration location should be enough to find where your respective configuration will live.

Note

Whether you should be removing this setting is a different question :). Once you've resolved your issues, it's a good idea to re-enable STRICT_TRANS_TABLES so that you don't encounter this again moving forward, or when upgrading MySQL in the future.

This won't affect every project, but some of my clients have older applications which are a little looser with their types - this helps until you can make things better!

See you at Burlington Ruby Conf

Tue, 29 Jul 2014 21:40:27 -0400

If you're going to be at Burlington Ruby Conf, tweet at me (@jpcamara) and say hi!

JP Camara

A comprehensive guide to PgBouncer/Postgres compatibility

SQL feature map for pooling modes

PgBouncer is useful, important, and fraught with peril

Contents

What is connection pooling?

Why do I need a separate tool from Postgres?

Can I just turn on PgBouncer and get scaling for free?

Perils

Detecting invalid statements 😑

Lock Timeouts (SET/RESET) 🔓

Bypassing PgBouncer

Use transaction level statements

Statement timeouts (SET/RESET) ⏳

With lock_timeout, why does statement_timeout even matter?

Our old friend transaction

Apply statement timeouts per user

Transparency 👻

Prepared Statements (PREPARE/DEALLOCATE, Protocol-level prepared plans) ✔️

Protocol-level prepared plans

Named protocol-level statements

Pool throughput / Long running queries 🏃‍♂️

Guarding against slow queries

Session Level Advisory Locks 🔐

Once more, with feeling PgBouncer

More session advisory lock use cases

Transaction level locks

Turn off advisory migration locks

Maintaining a separate direct connection to Postgres

Listen / Notify 📣

kind of?

The single thread 🪡

pg_dump 🚮

Other unavailable features 🫥

Linting 🧶

Can we improve connections without a pooler?

PgBouncer alternatives

Am I finally done with this post?

Making Tanstack Table 1000x faster with a 1 line change

Tracking down the issue

What was the purpose of that line and why was it so slow?

What does spread do behind the scenes?

Spread is an immutable pattern, so should the code be kept immutable?

Using push directly

Source code

Kicking the social media habit with “one sec”

… isn’t that basically Screen Time?

Is there anything more to it than friction?

Friction causes behavioral change

How do I use it?

What I’ve been doing instead

Throwing the baby out with the bath water5

Going even deeper with “one sec”

Next steps

Is there an ideal coding style?

Clean Subdomains in Laravel

How Laravel handles subdomains

Cleaning up our method signatures with middleware

Testing your subdomain locally using php artisan serve or any local server

Making the subdomain less hard-coded

Making the entire route configurable

Making the route an entry in your environment file

Closing

Using the Device Clipboard in React Native

Existing packages

How to use it

The whole example

React Native impressions from a Cordova user

Introduction

Where I'm Coming From

A Very Brief History

Why React Native

Learn Once, Write Anywhere

Truly native

Performance

First Class Native Projects + Keeping up with iOS/Android updates

Community

App Reloads / Dev Tools

Touch Events

UI Flow

Throwing the baby out with the bath water⁵

Testing your subdomain locally using `php artisan serve` or any local server