Robert Mosolgo

Books I Read in 2019

2020-02-04T14:19:00-05:00

I thought I’d try to remember what I read last year.

In roughly chronological order:

That Distant Land, Wendell Berry
The Good Shepherd and the Child, Sofia Cavaletti et al
Albemarle, Jefferson’s County, 1727-1976, John Hammond Moore
Perelandra, C.S. Lewis
Modern Technology and the Human Future: A Christian Perspective, Craig Gay
Grass Productivity, Andre Voisin
American Farmstead Cheese, Paul Kindstedt
A River Runs Through It, Norman Maclean
The Art and Science of Grazing, Sarah Flack
The New Organic Grower, Elliot Coleman
Out of the Silent Planet, C.S. Lewis
That Hideous Strength, C.S. Lewis
The Abolition of Man, C.S. Lewis
Small-Scale Dairy, Gianaclis Caldwell
Man’s Search for Meaning, Viktor Frankl
Jayber Crow, Wendell Berry
La Fromagerie a la Ferme, Patrick Anglade
Sex, Freedom, Economy, and Community, Wendell Berry
How (Not) to Be Secular, James Smith

Finding implicit returns with Rubocop

2019-11-14T09:57:00-05:00

Some notes on a refactor implemented with a Cop.

I’ve developed a real affection for Rubocop over the last couple of years. (Sorry to my old coworkers and friends at Planning Center, who put up with my complaining about it back then!) What I’ve come to appreciate is:

No fights about style. If it passes the linter, it’s ok to ship.
Enforcing usage coventions. For example, we have a cop to make sure that some risky methods aren’t used in the codebase.
Upgrading old code. For example, we realized we were sometimes using Promise.all(...) do instead of Promise.all(...).then do. The old code didn’t work at all. We added a Cop with an autocorrect implementation, so we could upgrade any mistakes automatically!

The Refactor: Returning Promises

We have some GraphQL/GraphQL-Batch code for making authorization checks. It looks like this:

class Types::Repository
  # This is GraphQL-Ruby's authorization API
  def self.authorized?(repository, ctx)
    # Load some data which is required for the check:
    batch_load(repository, :owner).then do |owner|
      # Call the authorization code:
      Authorization.can_see?(ctx[:viewer], repository, owner)
    end
  end
end

The authorized? check returns a Promise (for GraphQL-Batch), and inside that promise, .can_see? returns true or false (synchronously).

However, to improve data access, we wanted to implement a new authorization code path:

# Returns Promise
Authorization.async_can_see?(viewer, repo, owner)

This new code path would improve the database access under the hood to use our batch loading system.

After implementing the codepath, how could we update the ~1000 call sites to use the new method?

The Problem: Boolean Logic

The easiest solution would be find-and-replace, but that doesn’t quite work because of boolean logic with Promises. Some of our authorization checks combined two checks like this:

# Require both checks to pass:
Authorization.can_see?(...) && Authorization.can_see?(...)

If we updated that to async_can_see?, that code would break. It would break because async_can_see? always returns a Promise, which is truthy. That is:

promise_1 && promise_2

That code always returns true, even if one of the promises would resolve to false. (The Ruby Promise object is truthy, and we don’t have access to the returned value until we call promise.sync.)

So, we have to figure out which code paths can be automatically upgraded.

The Solution, In Theory

Roughly, the answer is:

If an authorization returns the value of .can_see?, then we can replace that call with .async_can_see?.

This is true because GraphQL-Ruby is happy to receive Promise – it will use its batching system to resolve it as late as possible.

So, how can we find cases when .can_see? is used as a return value? There are roughly two possibilities:

explicit returns, which we don’t use often
implicit returns, which are the last expressions of any branches in the method body.

This post covers that second case, implicit returns. We want to find implicit returns which are just calls to .can_see?, and automatically upgrade them. (Some calls will be left over, we’ll upgrade those by hand.)

We assume that any code which is more complicated than just a call to .can_see? can’t be migrated, because it might depend on the synchronous return of true|false. We’ll revisit those by hand.

The Implementation: A Cop

I knew I wanted two things:

For new code, require async_can_see? whenever possible
For existing code, upgrade to async_can_see? whenever it’s possible

Rubocop will do both of these things:

A linting rule will fail the build if invalid code is added to the project, addressing the first goal
A well-implemented def autocorrect will fix existing violations, addressing the second goal

But it all depends on implementing the check well: can I find implicit returns? Fortunately, I only need to find them well enough: it doesn’t have to find every possible Ruby implicit return; it only has to find the ones actually used in the codebase!

By an approach of trial and error, here’s what I ended up with:

# frozen_string_literal: true
class AsyncCanSeeWhenPossible < Rubocop::Cop
  MSG = <<-ERR
When `.can_see?` is the last call inside an authorization method, use
`.async_can_see?` instead so that the underlying data access can be batched.
ERR

  # If the given node is a call to `:can_see?`, it's yielded
  def_node_matcher :can_see_call, "$(send s(:const, {nil (cbase)}, :Authorization) :can_see? ...)"

  # Look for nested promises -- treat the body of a nested promise just like the method body.
  # (That is, the implicit return of the block is like the implicit return of the method)
  def_node_matcher :then_block, "(block (send _ :then) _ $({begin send block if case} ...))"

  # Check for `def self.authorized?` and call the cop on that method
  def on_defs(node)
    _self, method_name, *_args, method_body = *node
    if method_name == :authorized?
      check_implicit_return(method_body)
    end
  end

  # Replace `.can_see?` with `.async_can_see?`
  def autocorrect(node)
    lambda do |corrector|
      _receiver, method_name, *rest = *node
      corrector.replace(node.location.selector, "async_can_see?")
    end
  end

  private

  # Continue traversing `node` until you get to the last expression.
  # If that expression is a call to `.can_see?`, then add an offense.
  def check_implicit_return(node)
    case node.type
    when :begin
      # This node is a series of expressions.
      # The last one is the implicit return.
      *_prev_exps, last_expression = *node
      check_implicit_return(last_expression)
    when :block
      # It's a method call that receives a block.
      # If it's a then-block, check its body for implicit returns.
      then_block(node) do |block_body|
        check_implicit_return(block_body)
      end
    when :if
      # Check each branch of an `if ...` expression, because
      # each branch may be an implicit return
      # (elsif is part of the `else_exp`)
      _check, if_exp, else_exp = *node
      check_implicit_return(if_exp)
      # This can be null if there is no else expression
      if else_exp
        check_implicit_return(else_exp)
      end
    when :case
      # Check each branch of the case statement, since each one
      # could be an implicit return.
      _subject, *when_exps, else_exp = *node
      when_exps.each do |when_exp|
        *_when_conditions, condition_body = *when_exp
        check_implicit_return(condition_body)
      end
      # There may or may not be an `else` branch.
      if else_exp
        check_implicit_return(else_exp)
      end
    when :send
      # This is a method call -- if it's a plain call to `.can_see?`, flag it.
      can_see_call(node) do |bad_call|
        add_offense(bad_call, location: :selector)
      end
    else
      # We've reached an implicit return which is not:
      #
      # - An expression containing other implicit returns
      # - An expression calling `.can_see?`, which we know to upgrade
      #
      # So, ignore this implicit return.
    end
  end
end

With this cop, rubocop -a will upgrade the easy cases in existing code, then I’ll track down the harder ones by hand.

I think the implementation could be improved by:

Also checking explicit returns. It wasn’t important for me because there weren’t any in this code base. next Could probably be treated the same way, since it exists then blocks.
Flagging any use of .can_see?, not only the easy ones. I expect that some usages are inevitable, but better to require a rubocop:disable in that case to mark that it’s not best-practice.

(Full disclosure: we haven’t shipped this refactor yet. But I enjoyed the work on it so far, so I thought I’d write up what I learned!)

Breaking out of a yield with return

2019-09-02T11:28:00-04:00

Did you know that calling return in one Ruby method could affect the flow of another method? I discovered it today while hunting a GraphQL-Ruby bugfix. You can get more reliable behavior with ensure, if it’s appropriate.

Instrumentating a block

Let’s imagine a simple instrumentation system, where method wraps a block of code and tags it with a name:

def instrument_event(event_name)
  puts "begin    #{event_name}"
  result = yield
  puts "end      #{event_name}"
  result
end

You could use this to instrument a method call, for example:

def do_stuff_with_instrumentation
  instrument_event("do-stuff") do
    do_stuff
  end
end

do_stuff_with_instrumentation
# begin    do-stuff
# end      do-stuff

It prints the begin message, then the end message.

Returning early

But what if you return early from the block? For example:

# @param return_early [Boolean] if true, return before actually doing the stuff
def do_stuff_with_instrumentation(return_early:)
  instrument_event("do-stuff") do
    if return_early
      # Return from this method ... but also return from the `do ... end` instrumentation block
      return
    else
      do_stuff
    end
  end
end

If you instrument it without returning from inside the block, it logs normally:

do_stuff_with_instrumentation(return_early: false)
# begin    do-stuff
# end      do-stuff

But, if you return early, you only get half the log:

do_stuff_with_instrumentation(return_early: true)
# begin    do-stuff

Where’s the end message?

It Jumped!

Apparently, the return inside the inner method (#do_stuff_with_instrumentation) broke out of its own method and out of #instrument_event. I don’t know why it works like that.

With Ensure

If you refactor the instrumentation to use ensure, it won’t have this issue. Here’s the refactor:

def instrument_event(event_name)
  puts "begin    #{event_name}"
  yield
ensure
  puts "end      #{event_name}"
end

Then, it prints normally:

do_stuff_with_instrumentation(return_early: true)
# begin    do-stuff
# end      do-stuff

Of course, this also changes the behavior of the method when errors happen. The ensure code will be called even if yield raises an error. So, it might not always be the right choice. (I bet you could use $! to detect a currently-raised error, though.)

How I Make Yogurt

2019-03-28T09:25:00-04:00

Yesterday I excitedly recommended to a friend that he try making some yogurt at home. Then I realized that my personal recipe is a bit of a hodge-podge, so I thought I’d try writing it out.

This recipe isn’t perfect: when using raw milk, a bit of the cream still separates to the top while it’s culturing. (I’d rather have it all mixed in, but I guess you could call it “cream top”!)

My sources are:

I’d like to add pictures someday, but for now, I recommend the pictures on Fankhauser’s blog.

Prep and sterilize

Yogurt will only be as good as what you put in it. I generally sterilize everything I’ll need for the recipe:

A spoon for working with the starter culture
A bowl for mixing starter with milk
Jars for holding the yogurt, and their lids
A ladle for bottling
A funnel for bottling

To sterilize these items, I either:

Put everything in the dishwasher (without soap) ad let it run until the “sterilized” light is green; or
Put everything in a big pot with a few inches of water at the bottom. Boil the water and keep a slow boil for 10 minutes. (This technique is from Fankhauser, above.)

Sometimes I forget an item at this step, in which case I wash it as welll as I can and hope for the best. I’ve never had anything really spoil, but I did have a batch that had a bit of kefir taste to it! I assume it had some yeast contamination. I still ate it 🤷‍.

Heat the milk & hold

Fankhauser recommends this step for pasteurization purposes, to kill unwanted bacteria. Brod & Taylor’s recipe prescribes a higher temperature (195°F) and a longer holding time in order to mess with the proteins and get a thicker yogurt. (I really don’t know how it accomplishes that. I read that it “denatures whey proteins”, and anyhow, I’m convinced because this is the same temperature you use to make whey ricotta, so there must be something to it.)

Anyway, first, heat the milk to 190°F and then take it off of the heat for 20 minutes.

The real trick is not to burn the milk. Whenever I cook it in a pot on the stove, I burn it on the bottom, no matter how much I use low heat and stir. Burning the milk is a double-whammy: it caramelizes some milk sugar, giving a hint of weird taste to the yogurt, and it sticks like cement to the bottom of the pot (adding insult to injury).

Finally, I found an approach that doesn’t burn the milk. I put a 2-gallon pot of milk inside my canner (without a lid) and fill the canner with water, then use it like a giant double-boiler. Just like a double-boiler, the hot water will buffer the temperature (it can’t go above boiling), and as a bonus, it heats the milk faster and more evenly, since the pot is surrounded with hot water. When the milk reaches the target temperature, I take it out of the double-boiler and set it aside.

Cool the milk

The milk is now ready to become delicious yogurt, but it’s too hot for yogurt cultures to survive. Cool the milk to 120°F by placing it a sink of ice water. Stir frequently to equalize the temperature of the milk, and remove it when it reaches the target temperature.

Inoculate

Now, add the starter culture to the pot of milk. I’ve done it two ways:

Add a packet of thermophilic starter. This is easy and reliable. I buy the starter at Fifth Season Gardening downtown. The downside is, it’s more expensive than…
Or, add 1 cup of yogurt per gallon of milk (this is Fankauser’s ratio from the recipe above). You can use yogurt from your own previous batch, or use store-bought: find plain yogurt labeled with “active cultures” and a “use-by” date in the far future (so it’s fresh). When adding yogurt this way, first mix equal parts yogurt and warmed milk (from the pot) in a bow, and stir until the consistency is even. Then, pour this mixture back into the pot and stir. (This is Fankhauser’s technique. I assume it’s to ensure that the culture is evenly distributed.)

Bottle it

Now that your yogurt is inoculated, distribute it into your jars (or whatever) to culture. I make 2 gallons at a time, so I use 8 quart jars.

Set it

Put your jars in a cooler, and fill the cooler with 120°F water. (Actually, I prep the cooler by adding some hot water first, then dumping it out and refilling this. I hope it warms up the cooler ahead-of-time.)

I used to really fuss with the water temperature, checking it from time to time and heating it back up to keep it close to 120°F. But then, I read in the Brod & Taylor recipe (linked above) that you might get a better texture by reducing the temperature after 1 hour.

I don’t follow that recipe, but I do close the cooler and forget about it. It cools off on its own and seems to turn out fine. I also read somewhere (don’t remember where?) that temperature variance helps complementary cultures (Bulgaricus and Thermophilus) get their own time to work on the milk. Something like, one of the bacteria does better on the higher range, while the other does better on the lower range.

Forget it

I leave it for 8-16 hours, roughly all day or overnight. “Done” is a matter of taste. More culturing will make a tangier yogurt that separates whey more. Less time will make a sweeter yogurt with less whey separation.

I used to leave it for 24-ish hours, but I realized that the sweeter yogurt is tart enough and takes less time. Besides, I read on Brod & Taylor’s “How to Maintain a Yogurt Culture” that the culture will last longer if it has more lactose to digest while it’s in the fridge. I don’t follow all the steps in that article, but I hope it will keep my culture going strong for longer.

Eat it

Dry off the jars and put them in the fridge. They’re sterilized and pasteurized, so besides the yogurt bacteria continuing to process the lactose, they don’t really spoil. I make yogurt once every month or two.

We end up eating yogurt as:

milk for cereal
a vehicle for jam or honey
strained, as a replacement for sour cream
mixed into various sauces (herbs, garlic, etc)

I’d like to try making frozen yogurt, but I haven’t yet!

A New Runtime in GraphQL-Ruby 1.9

2019-01-29T07:22:00-05:00

GraphQL-Ruby 1.9.0 introduces a new runtime called GraphQL::Execution::Interpreter. It offers better performance and some new features.

In isolated benchmarks, the new runtime is about 50% faster. We saw about a 10% speedup in GitHub when we migrated.

You can opt in by adding to your schema:

class MySchema < GraphQL::Schema
  # To use the new runtime
  use GraphQL::Execution::Interpreter
  # To skip preprocessing (you can use the interpreter without adding this)
  use GraphQL::Analysis::AST
end

But why rewrite?

Problem 1: per-field context objects

Previously, each field evaluated by GraphQL-Ruby got its own instance of GraphQL::Query::Context::FieldResolutionContext. This was introduced so that fields using graphql-batch-style Promises could reliably access context values (like ctx.path) after returning from the resolver (ie, when the promise was synced.)

The problem was, the bigger the response, the more ctx objects would be created – and most of the time (for example, plain scalar fields), they were never used by application code. So, we allocated, initialized, then GCed these objects for nothing!

In fact, it wasn’t for nothing. As time passed, I started using those context objects inside execution code. For example, null propagation was implemented by climbing up the tree of context objects. So you couldn’t just stop creating them – the runtime depended on them.

Solution: one mutable context

To remove this performance issue, I went back to creating a single Query::Context object and passing it to resolvers. If you’re using the new class-based API, you might have noticed that self.context is a Query::Context, not a Query::Context::FieldResolutionContext. I did it this way to pave the way for removing this bottleneck.

But what about access to runtime information?

Solution: explicit requests for runtime info

For fields that want runtime info (like path or ast_node), they can opt into it with extras: [...], for example:

field :items, ..., extras: [:path]

By adding that configuration, the requested value will be injected into the resolver:

def items(path:)
  # ...
end

path will be a frozen Array describing the current point in the GraphQL response.

Solution: reimplementing the runtime

Finally, since FieldResolutionContexts aren’t necessary for user code, we can rewrite execution to not create or use them anymore. Under the hood, GraphQL::Execution::Interpreter doesn’t create those ctx objects. Instead, null propagation is implemented manually and all necessary values are passed from method to method.

Problem 2: inefficient preprocessing

Years ago, someone requested the feature of rejecting a query before running it. They wanted to analyze the incoming query, and if it was too big or too complicated, reject it.

How could this be implemented? You could provide user access to the AST, but that would leave some difficult processing to user code, for example, merging fragments on interfaces.

So, I added GraphQL::InternalRepresentation as a normalized, pre-processed query structure. Before running a query, the AST was transformed into a tree of irep_nodes. Users could analyze that structure and reject queries if desired.

In execution code, why throw away the result of that preprocessing? The runtime also used irep_nodes to save re-calculating fragment merging.

In fact, even static validation used the irep_node tree. At some point, rather than re-implement fragment merging, I decided to hook into that rewritten tree to implement FragmentsWillMerge. After all, why throw away that work?

(As it turns out, someone should fire the GraphQL-Ruby maintainer. These layers of code were not well-isolated!!)

Problem 2.1: Preparing the `irep_node`s was slow and often a waste

Since the irep_node tree was built for analysis, it generated branches for every possible combination of interfaces, objects, and unions. This meant that, even for a query returning very simple data, the pre-processing step might be very complex.

To make matters worse, the complexity of this preprocessing would grow as the schema grew. The more implementers an interface has, the longer it takes to calculate the possible branches in a fragment.

Problem 2.2: Runtime features were implemented during preprocessing

Not only was the work complex, but it also couldn’t be cached. This is because, while building the irep_node tree, @skip and @include would be evaluated with the current query variables. If nodes were skipped, they were left out of the irep_node tree.

This means that, for the same query in your code base, you couldn’t reuse the irep_node tree, since the values for those query variables might be different from one execution to the next. Boo, hiss!

Problem 2.3: A wacky preprocessing step is hard to understand

I want to empower people to use GraphQL-Ruby in creative ways, but throwing a wacky, custom data structure in the mix doesn’t make it easy. I think an easier execution model will encourage people to learn how it works and build cool new stuff!

Solution: No preprocessing

The new runtime evaluates the AST directly. Runtime features (@skip and @include, for example) are implemented at, well, runtime!

Solution: AST Analyzers

Since you can’t use the irep_node tree for analysis anymore, the library includes a new module, GraphQL::Analysis::AST, for preprocessing queries. Shout out to @xuorig for this module!

Solution: Moving ahead-of-time checks to runtime

For GitHub, we moved a lot of analyzer behavior to runtime. We did this because it’s easier to maintain and requires less GraphQL-specific knowledge to understand and modify. Although the client experience is slightly different, it’s still good.

For example, we had an analyzer to check that pagination parameters (eg first and last) were valid. We moved this to runtime, adding it to our connection tooling.

Solution: `GraphQL::Execution::Lookahead`

irep_nodes were useful for looking ahead in a query to see what fields would be selected next. (Honestly, they weren’t that good, but they were the only thing we had, beside using the AST directly).

To support that use, we now have extras: [:lookahead] which will inject an instance of GraphQL::Execution::Lookahead, with an API explicitly for checking fields later in the query.

Other considerations

Resolve procs are out

As part of the change with removing FieldResolutionContext, the new runtime doesn’t support proc-style resolvers ->(obj, args, ctx) {...}. Besides ctx, the args objects (GraphQL::Query::Arguments) are not created by the interpreter either. Instead, the interpreter uses plain hashes.

Instead of procs, methods on Object type classes should be used.

This means that proc-based features are also not supported. Field instrumenters and middlewares won’t be called; a new feature called field extensions should be used instead.

`.to_graphql` is almost out

When the class-based schema API was added to GraphQL-Ruby, there was a little problem. The class-based API was great for developers, but the execution API expected legacy-style objects. The bridge was crossed via a compatibility layer: each type class had a def self.to_graphql method which returned a legacy-style object based on that class. Internally, the class and legacy object were cached together.

The interpreter doesn’t use those legacy objects, only classes. So, any type extensions that you’ve built will have to be supported on those classes.

The catch is, I’m not 100% sure that uses of legacy objects have all been migrated. In GitHub, we transitioned by delegating methods from the legacy objects to their source classes, and I haven’t removed those delegations yet. So, there might still be uses of legacy objects 😅.

In a future version, I want to remove the use of those objects completely!

Conclusion

I hope this post has clarified some of the goals and approaches toward adding the new runtime. I’m already building new features for it, like custom directives and better subscription support. If you have a question or concern, please open an issue to discuss!

Notes on Small is Beautiful

2018-11-16T10:10:00-05:00

I just finished reading E.F. Schumacher’s Small Is Beautiful: A Study of Economics As If People Mattered, so I thought I’d jot down a few impressions before I forgot them.

Background

Schumacher was a post-WWII British economist. He advised the British National Coal Board (the nationalized coal company) and rebuilding of postwar Germany.

The book was published in 1973, the same year as the oil crisis, which raised some questions about our dependence on imported petroleum.

Greed and Capitalism

Schumacher returns again and again to a quote from John Maynard Keynes):

For at least another hundred years we must pretend to ourselves and to everyone that fair is foul and foul is fair; for foul is useful and fair is not. Avarice and usury and precaution must be our gods for a little longer still.

In context, Keynes is claiming that eventually, people will recognize will give up a love of money because they find that it doesn’t satisfy. But first, a capitalist order must create material abundance.

For Schumacher, this triggers a series of conclusions:

Capitalism is predicated on greed.
Greed may “benefit” the greedy, but it doesn’t benefit other individuals or communities.
Since capitalism is built on greed, we shouldn’t expect a rising tide to lift all boats. On the contrary, we can expect “winners” to build boat lifts to raise their own boats.
As a society, when we set our sights on accumulating wealth (for example, by celebrating the rich), we promote greed among ourselves. We shouldn’t expect this to ever lead to the renunciation of material wealth.
Since the system is predicated on having more, we should never expect to have enough, and consequently, we shouldn’t expect this order to bring any kind of peace.

As a modern person, I ask myself, are we arriving at Keynes’s expected conclusion? He wrote:

When the accumulation of wealth is no longer of high social importance, there will be great changes in the code of morals.

I wonder when that “When…” is/was expected to arrive.

Human and Inhuman, Freedom and Order

Schumacher’s understanding of social malaise and environmental destruction boils down to a claim about organizational structure, something like: when an enterprise becomes so big that ownership becomes isolated from execution, it becomes inhuman (in the sense that decision-making can’t be made with human-to-human perspective), and as a result, workers are reduced to cogs, natural resources are reduced to consumable inputs, and and the like.

Schumacher assumes (observes?) that when humans make decisions regarding their neighbors and hometowns, they are more likely to consider the non-economic factors of their decisions. For example, a non-economic factor might be a beautiful landscape, a creatively engaging endeavor, or caring for something (or someone) who can’t care for itself. For a profit-oriented calculation at headquarters, these factors might be weighed less heavily.

To simplify one of Schumacher’s maxims to address this issue, he suggests that nothing should be centralized if it can be decentralized. He acknowledges that some order is required in order for our larger societal goals to be accomplished, but also, he warns that too much order stunts many non-quantifiable joys in life. So, by decentralizing, you can engage human entrepreneurial spirit in a way that is healthy for the localities it impacts.

Intermediate Technology

A large section of the book (“The Third World”) addresses development in poor countries. Schumacher criticizes the dominant mode of development, namely, the installation of capital-intensive heavy industries in large cities. He cites several problems:

The purchase of capital (eg, a factory) requires the poor country to take on debt from the rich country, leading to a kind of economic bondage
High-tech industries require a highly-educated workforce, which is often not available locally, so foreign workers (from rich countries) are employed instead of local people. Since Schumacher considers work (at its best) to be an edifying and satisfying experience, this deprives local people of an important benefit of the development.
High-tech, city-oriented development leads to a two-fold economy, where a metropolitan elite exists in a different sphere than their rural compatriots.
High-tech products can’t be purchased locally, so they must be exported. This doesn’t strengthen the local economy (eg, it doesn’t foster more local businesses). Similarly, high-tech industries may require very refined inputs which are not available locally, so instead of using local inputs, the enterprise will depend on imports from elsewhere.

Schumacher espouses a different approach, “intermediate technology”. In this approach, gradual improvements in technology are applied to slowly raise the level of economic activity in a community. For example, if human labor is readily available, a more low-capital, labor-intensive solution might be preferred to a high-capital, low-labor solution. For example, consumer goods might be made by hand instead of by machine, since that will employ more local people and can adapt to a greater variety of inputs. Additionally, several enterprises should be fostered, and they should target local markets, so that newly-employed people can participate in commerce with one another.

I have left out specific examples; the book is full of them.

You can see how this hangs on Schumacher’s conviction that work can be good for people. In “Buddhist Economics”, he highlights some possible benefits of employment, for example:

It challenges our ego by reminding us that we work together to accomplish things
It fosters social connections by bringing us into necessary contact with one another
It benefits individuals by developing their skill and wisdom
Finally, it produces the things a community needs to survive

All these goods focus on the people involved in the enterprise, not the capital or products. Capital should be used in service to the people, not people in service to capital.

Metaphysical Underpinnings for Economics

In “The Greatest Resource - Education”, Schumacher points out the any economic theory rests on a “meta-economics”, that is, a set of assumptions about what things are and what they mean. He describes it as

ideas that would make the world, and [our] own lives, intelligible to [us]; when a thing is intelligible, you have a sense of participation; when a thing is unintelligible, you have a sense of estrangement.

Importantly, these things are absorbed and transmitted without our active recognition of it. Our minds are “furnished” by our communities without our awareness (much less our permission!).

When it comes to our own “meta-economics”, outlines several of our assumptions about the world:

The universe is “the outcome of accidental collocations of atoms” (he quotes Bertand Russell)
The “higher manifestations of life … are nothing but … a superstructure erected to disguise and promote economic interest”, an application of reductionism to the human experience
“[R]elativism, denying all absolutes, dissolving all norms and standards, leading to the total undermining of the idea of truth in pragmatism…” I think this means, rejecting historical understandings of “right” and replacing them with “might is right”.
The notion that “valid knowledge can be attained only through the natural sciences and hence that no knowledge is genuine unless it is based on generally observable facts.” This may be contrasted with the claim that knowledge may be gained through qualitative means, like experience over time or intuitive exploration.

For Schumacher, since these ideas give rise to our economics, they can also be understood as the source of our social ills. Since they’re responsible for our sense of powerlessness, our quickness to consume and destroy the earth, and our dissatisfaction with these things, we can’t expect more of the same to remedy those ills.

Schumacher also prescribes a suite of assumptions which he think will give rise to the kind of economics he espouses:

The universe has a hierarchical structure (he says, “Levels of Being” or “Grades of Significance”) and it is “[our] task – or simply, if you like, [our] happiness – to attain a higher degree of realization of [our] potentialities, a higher level of being or ‘grade of significance’ than that which comes to [us] ‘naturally’.”
Many important tasks in life require the reconciliation of opposites, and these tasks are not impossible. (“How can one reconcile the demands of freedom and discipline in education? Countless mothers and teachers, in fact, do it, but no one can write down a solution.”) These tasks (“divergent”) are contrasted with “convergent” tasks, such as solving equations or taking measurements, which require finding a solution without clear problem space.
There are virtues and vices. Virtues, which benefit individuals and communities, should be pursued oneself and encouraged in others, while vices, which injure individuals and communities, should be challenged within oneself and discouraged in others.

Schumacher points to examples of these assumptions in several philosophical traditions outside post-Enlightenment Western thought.

(This post sat as a draft for a long time. I filled in the last part after returning the book.)

A Theory of Large Organizations

At the end of the book, Schumacher provides a positive theory of large organizations. Since I’ve returned the book, I’ll write the maxim that really stuck with me: anything that can be decentralized should be decentralized.

Toward the end of engaging peoples’ entrepreneurial spirit, underlings should be given as much autonomy and authority as possible. Besides doing better work, the boots-on-the-ground folks can make more appropriate consideration of local context.

Schumacher also imagined a really interesting relationship between government and big business. He pointed out that the current arrangement of taxation creates some bizarre incentives: since only profit is taxed, companies work to hide profit from the government. It’s impractical that the governemnt builds the infrastructure for businesses to thrive, then puts itself in the position of a bandit, trying to recapture (via taxation) its fair share of the gain.

What if, instead, big companies had the local government as a 50% shareholder? Shumacher proposes that the government party would be observer-only, except in circumstances when the local common good would require some representation. But, the government party would be entitled to 50% of the profit instead of taxing the business. In an arrangement like that, the business is incentivized to grow its profits, and the government doesn’t have to fight to recapture its investment in business infrastructure.

I don’t have any experience in that kind of large-scale thinking, but I found it an interesting scenario to imagine.

Schumacher’s Foundation

Interestingly, Schumacher clearly builds his vision on a Christian understanding of humans and work. He sees humans as reflecting their creator’s nature: creative, social, loving, and capable. At its best, work is not a necessary evil, but instead, it’s a good part of culture where people can engage those attributes. That perspective orients his thoughts towards goals other than “putting food on the table” (although sustenance is a goal), for example, engendering pride in one’s work, connections between neighbors, and the development and exercise of human skill.

Trampolining

2018-09-23T21:04:00-04:00

As part of my work on improving GraphQL-Ruby’s runtime performance, I’ve been reading Essentials of Programming Languages. Here, I try to apply their lesson about “trampolining”.

TL;DR: I applied a thing I read in a textbook and it:

reduced the stack trace size by 80%
reduced the live object count by 15%
kept the same runtime speed

You can see the diff and benchmark results here: https://github.com/rmosolgo/graphql-ruby/compare/1b306fad…eef73b1

The Problem

It’s a bit funny, but it’s not totally clear to me what the book is trying to get at here. In the book, they talk about control context or continuations in a way that I would talk about “stack frames”. I think the problem is this: when you implement a programming language as an interpreter, you end up with recursive method calls, and that recursion builds up a big stack in the host language. This is bad because it hogs memory.

I can definitely imagine that this is a problem in Ruby, although I haven’t measured it. GraphQL-Ruby uses recursion to execute GraphQL queries, and I can imagine that those recursive backtrace frames hog memory for a couple reasons:

The control frames themselves (managed by YARV or something) take up memory in their own right
The control frames each have a lexical scope (binding), which, since it’s still on the stack, can’t be GCed. So, Ruby holds on to a lot of objects which could be garbaged collected if the library was written better.

Besides that, the long backtrace adds a lot of noise when debugging.

Trampolining

In the book, they say, “move your recursive calls to tail position, then, assuming your language has tail-call optimization, you won’t have this problem.” Well, my language doesn’t have tail-call optimization, so I do have this problem! (Ok, it’s an option.)

Luckily for me, they describe a technique for solving the problem without tail-call optimization. It’s called trampolining, and it works roughly like this:

When a method would make a recursive call, instead, return a Bounce. Then, the top-level method, which previously received the FinalValue of the interpreter’s work, should be extended to accept either a FinalValue or a Bounce. In the case of a FinalValue, it returns the value as previously. In the case of a Bounce, it re-enters the interpreter using the “bounced” value.

Using this technique, a previously-recursive method now returns, giving the caller some information about how to take the next step.

Let’s give it a try.

The Setup

I want to test impact in two ways: memory consumption and backtrace size. I want to measure these values during GraphQL execution, so what better way to do it but build a GraphQL schema!

You can see the whole benchmark, but in short, we’ll run a deeply-nested query, and at the deepest point, measure the backtrace size and the number of live objects in the heap:

{
  nestedMetric {
    nestedMetric {
      nestedMetric {
        # ... more nesting ...
        nestedMetric {
          backtraceSize
          objectCount
        }
      }
    }
  }
}

Where the fields are implemented by:

def backtrace_size
  caller.size
end

def object_count
  # Make a GC pass
  GC.start
  # Count how many objects are alive in the heap,
  # subtracting the number of live objects before we started
  GC.stat[:heap_live_slots] - self.class.object_count_baseline
end

We’ll use these measurements to assess the impact of the refactor.

The Pledge: Recursive calls

To begin with, the interpreter is implemented as a set of recursive methods. The methods do things like:

Given an object and a set of selections, resolve the selected fields on that object
Given a value and a type, prepare the value for a GraphQL response according to the type

These methods are recursive in the case of fields that return GraphQL objects. The first method resolves a field and calls the second method; then the second method, in order to prepare an object as a GraphQL response, calls back to the first method, to resolve selections on that object. For example, execution might work like this:

Resolve selections on the root object
- One of the selections returned a User
  - Resolve selections on the User
    - One of the selections returns a Repository
      - Resolve selections on the Repository
        
        …

Do you see how the same procedure is being applied over and over, in a nested way? That’s implemented with recursive calls in GraphQL-Ruby.

We can run our test to see how the Ruby execution context looks in this case:

# $ ruby test.rb
1b306fad3b6b35dd06248028883cd8a3ec4bdefd
{"backtraceSize"=>282, "objectCount"=>812}

This is the baseline for backtrace size and object count, which we’re using to measure memory overhead in GraphQL execution. (This describes behavior at this commit.)

The Turn: Moving Recursive Calls into Tail Position

As a requirement for the final refactor, we have to do some code reorganization. In the current code, the recursive calls require some setup and teardown around them. For example, we track the GraphQL “path”, which is the list of fields that describe where we are in the response. Here’s a field with its “path”:

{
  a {
    b {
      c # The path of this field ["a", "b", "c"]
    }
  }
}

In the code, it looks something like this:

# Append to the path for the duration of the nested call
@path.push(field_name)
# Continue executing, with the new path in context
execute_recursively(...)
# Remove the entry from `path`, since we're done here
@path.pop

The problem is, if I want to refactor execute_recursively to become a Bounce, it won’t do me any good, because the value of execute_recursively isn’t returned from the method. It’s not the last call in the method, so its value isn’t returned. Instead, the value of @path.pop is returned. (It’s not used for anything.)

This is to say: @path.pop is in tail position, the last call in the method. But I want execute_recursively to be in tail position.

A Hack Won’t Work

The easiest way to “fix” that would be to refactor the method to return the value of execute_recursively:

# Append to the path for the duration of the nested call
@path.push(field_name)
# Continue executing
return_value = execute_recursively(...)
# Remove the entry from `path`, since we're done here
@path.pop
# Manually return the execution value
return_value

The problem is, when execute_recursively is refactored to be a Bounce:

# Append to the path for the duration of the nested call
@path.push(field_name)
# Continue executing
bounce = prepare_bounce(...)
# Remove the entry from `path`, since we're done here
@path.pop
# Manually return the execution value
bounce

By the time the bounce is actually executed, path won’t have the changes I need in it. The value is pushed and popped before the bounce is actually called.

Pass the Path as Input

The solution is to remove the need for @path.pop. This can be done by creating a new path and passing it as input.

# Create a new path for nested execution
new_path = path + [field_name]
# Pass it as an input
execute_recursively(new_path, ...)

Now, execute_recursively is in tail position!

(The actual refactor is here: https://github.com/rmosolgo/graphql-ruby/commit/ef6e94283ecf280b14fe5417a4ee6896a06ebe69)

The Prestige: Make it Bounce

Now, we want to replace recursive calls with a bounce, where a bounce is an object with enough information to continue execution at a later point in time.

Since my recursive interpreter is implemented with a bunch of stateless methods (they’re stateless since the refactor above), I can create a Bounce class that will continue by calling the same method:

class Bounce
  # Take the inputs required to call the next method
  def initialize(object, method, *arguments)
    @object = object
    @method = method
    @arguments = arguments
  end

  # Continue by calling the method with the given inputs
  def continue
    @object.send(@method, *@arguments)
  end
end

Then, I replace the tail-position recursive calls with bounces:

- execute_recursively(...)
+ Bounce.new(self, :execute_recursively, ...)

Instead of growing the backtrace by calling another method, we’ll be shrinking the backtrace by returning from the current method with a Bounce.

You can see the refactor here: https://github.com/rmosolgo/graphql-ruby/commit/b8e51573652b736d67235080e8b450d6fc9cc92e

How’d it work?

Let’s run the test:

# $ ruby test.rb
b8e51573652b736d67235080e8b450d6fc9cc92e
{"backtraceSize"=>55, "objectCount"=>686}

It’s a success! The backtraceSize decreased from 282 to 55. The objectCount decreased from 812 to 686.

Implementation Considerations

“Trampolining” is the process of taking each bounce and continuing it. In my first implementation, def trampoline looked like this:

# Follow all the bounces until there aren't any left
def trampoline(bounce)
  case bounce
  when Bounce
    trampoline(bounce.continue)
  when Array
    bounce.each { |b| trampoline(b) }
  else
    # not a bounce, do nothing
  end
end

My test indicated no improvement in memory overhead, so I frustratedly called it quits. While brushing my teeth before bed, it hit me! I had unwittingly re-introduced recursive method calls. So, I hurried downstairs and reimplemented def trampoline to use a while loop and a buffer of bounces, an approach which didn’t grow the Ruby execution context. Then the test result was much better.

Another consideration is the overhead of Bounces themselves. My first implementation creates a bounce before resolving each field. For very large responses, this will add a lot of overhead, especially when the field is a simple leaf value. This should be improved somehow.

What about Speed?

It turns out that visitors to the website don’t care about backtrace size or Ruby heap size, they just care about waiting for webpages to load. Lucky for me, my benchmark includes some runtime measurements, and the results were basically the same:

# before
Calculating -------------------------------------
                         92.144  (±10.9%) i/s -    456.000  in   5.022617s
# after
Calculating -------------------------------------
                        113.529  (± 7.9%) i/s -    567.000  in   5.031847s

The runtime performance was very similar, almost within the margin of error. However, the consideration of Bounce overhead described above could cause worse performance in some cases.

What’s next?

This code isn’t quite ready for GraphQL-Ruby, but I think it’s promising for a few reasons:

The reduction of memory overhead and backtrace noise could pay off for very large, nested queries
I might be able to leverage bounces to give the caller more control over how GraphQL queries are executed. For example, at GitHub, we use GraphQL queries when rendering HTML pages. With some work, maybe we could alternate between bouncing GraphQL and rendering HTML, so we’d get a better progressive rendering experience on the front end.

However, one serious issue still needs to be addressed: what about the Bounce’s own overhead? Allocating a new object for every field execution is already a performance issue in GraphQL-Ruby, and I’m trying hard to remove it. So the implementation will need to be more subtle in that regard.

Trip Report: Balkan Ruby 2018

2018-06-14T15:02:00-04:00

This May, I had the opportunity to attend Balkan Ruby and present on my work with graphql-ruby.

Here are a few thoughts about the trip.

The Conference

Balkan Ruby was a big hit. Personally, some of my favorites were:

Zach Holman’s opening talk about datetimes and timezones (which became a blog post and inspired a GraphQL-Ruby ISO8601 scalar type)
Sameer Deshmukh’s presentation about various SciRuby projects
Marko Bogdanović’s talk about RubyBench, which was a really cool project and made me want something like it for GraphQL-Ruby.

One of my favorite parts of the conference was the code challenges set up by Receipt Bank, one of the sponsors.

Every few hours, a new, wacky challenge would go live. Although I didn’t do well on them, I enjoyed working with a few new friends on different solutions, and seeing the creative things that other attendees submitted!

The City

Sofia was great. A beautiful city with interesting architecture, tons of trees and tasty food.

The Nevski Cathedral was built in the early 1900s to celebrate Russia’s liberation of Bulgaria from the Ottomans:

Inside, a mural of Abraham and Isaac:

And St. Cyril and St. Methodius, creators of the Cyrillic alphabet, who are quite popular around here:

I really enjoyed the different cathedrals. There’s something cool about the different instructive artwork and “sacred” feeling of a beautiful building with incense burning. I wonder if modern American churches could do more to engage all of our senses.

Also, a bit of Soviet history found in a nearby park:

And here, some recently excavated Roman ruins, and the one remaining Turkish mosque downtown:

The mosque was right beside ruins of an old bathhouse. Apparently that’s why Sofia was founded here – there were hot springs on the road between Rome and Constantinople, so the Romans set up camp (and called it Serdica).

And a fairly typical meal during my time there, a shopska salad (veggies with cheese, oil, and vinegar):

I can’t say enough good things about the local dairy products. The cheese was soft and fresh and the yogurt was tart and refreshing.

The Organizers

My favorite part about programming conferences is meeting the smart, caring folks who make them possible, and Balkan Ruby was no exception.

The two main organizers, Genadi and Vestimir were fantastic hosts (and experienced, since they got their start with Euruko a few years back). Besides that, I really enjoyed meeting the volunteers and learning a bit about life in Sofia.

One thing that stood out to me was the tradition behind the local liquor, rakia. It turns out that many families make it themselves, despite a law against owning stills. I’ve been reading that peach wine was the traditional alcoholic drink for the earliest European arrivals to my area, so I decided to give it a shot this summer!

A big bonus was when Vestimir played trail guide for our hike up the nearby mountain, Mt. Vitosha. It turned out to be a gray day, but we had a blast anyways.

Some pictures of the trail:

Beautiful! But you could say were were a bit underdressed XD

At the summit, we were happy to find a lodge where some food was served.

Some traditional bean soup, bread, lyutenitsa, cheese tea, and rakia never tasted so good.

Enjoying a rest:

(Left-to-right: Andreas, Nynne, Sameer, Me and Vestimir)

And, pleasantly, we caught a nice view of Sofia on the way back down:

(You can even see the Nevski cathedral if you look closely!)

Closing Thoughts

Balkan Ruby was a big hit on all fronts: great people, great city, great technical content. Especially as a dairy lover, I’ll take the next chance I get to go back! And I loved making some new friends, who I hope to see at future Ruby events.

How Ripper parses variables

2018-05-21T14:11:00-04:00

Ruby has a few different kinds of variables, and Ripper expresses them with a few different nodes.

Here are the different variables in Ruby:

a   # Local variable (or method call on self)
$a  # Global variable
A   # Constant
@a  # Instance variable
@@a # Class variable

# Bonus, not variables:
a()       # explicit method call (with parens) on implicit self
a b       # explicit method call (with args) on implicit self
self.a    # explicit method call (with dot) on explicit self

Here is how Ripper parses the above code:

# Ripper.sexp_raw(...) =>

[:program,
 [:stmts_add,
  [:stmts_add,
   [:stmts_add,
    [:stmts_add,
     [:stmts_add,
      [:stmts_add,
       [:stmts_add,
        [:stmts_add, [:stmts_new], [:vcall, [:@ident, "a", [1, 0]]]],
        [:var_ref, [:@gvar, "$a", [2, 0]]]],
       [:var_ref, [:@const, "A", [3, 0]]]],
      [:var_ref, [:@ivar, "@a", [4, 0]]]],
     [:var_ref, [:@cvar, "@@a", [5, 0]]]],
    [:method_add_arg, [:fcall, [:@ident, "a", [8, 0]]], [:arg_paren, nil]]],
   [:command,
    [:@ident, "a", [9, 0]],
    [:args_add_block,
     [:args_add, [:args_new], [:vcall, [:@ident, "b", [9, 2]]]],
     false]]],
  [:call, [:var_ref, [:@kw, "self", [10, 0]]], :".", [:@ident, "a", [10, 5]]]]]

(Ripper-preview)

Let’s check out those nodes.

:vcall

# a
[:vcall, [:@ident, "a", [1, 0]]]]

A :vcall is a bareword, either a local variable lookup or a method call on self. Used alone, this can only be determined at runtime, depending on the binding. If there’s a local variable, it will be used. My guess is that :vcall is short for “variable/call”

Interestingly, there is a single-expression case which could be disambiguated statically, but Ripper still uses :vcall:

# a b
[:command,
 [:@ident, "a", [1, 0]],
 [:args_add_block,
  [:args_add, [:args_new], [:vcall, [:@ident, "b", [1, 2]]]],
  false]]]]

:var_ref

# $a
[:var_ref, [:@gvar, "$a", [1, 0]]]
# A
[:var_ref, [:@const, "A", [1, 0]]]
# @a
[:var_ref, [:@ivar, "@a", [4, 0]]]
# @@aa
[:var_ref, [:@cvar, "@@a", [5, 0]]]

:var_ref (presumably “variable reference”) is shared by many of these examples, and can always be resolved to a variable lookup, never a method call. Its argument tells what kind of lookup to do (global, constant, instance, class), and what name to look up.

Method calls

Some Ruby can be statically known to be a method call, not a variable lookup:

# a(), explicit method call (with parens) on implicit self
[:method_add_arg, [:fcall, [:@ident, "a", [1, 0]]], [:arg_paren, nil]]
# self.a, explicit method call (with dot) on explicit self
[:call, [:var_ref, [:@kw, "self", [1, 0]]], :".", [:@ident, "a", [1, 5]]]
# a b, explicit method call (with arguments) on implicit self
[:command,
   [:@ident, "a", [10, 0]],
   [:args_add_block,
    [:args_add, [:args_new], [:vcall, [:@ident, "b", [10, 2]]]],
    false]]]

In these cases, :fcall, :call and :command are used to represent definite method sends.

Interestingly, :var_ref is used for self, too.

Updating GitHub to GraphQL 1.8.0

2018-04-09T09:52:00-04:00

GraphQL 1.8.0 was designed and built largely as a part of my work at GitHub. Besides designing the new Schema definition API, I migrated our codebase to use it. Here are some field notes from my migration.

If you want to know more about the motivations behind this work, check out this previous post.

Below, I’ll cover:

The Process: in general, how I went about migrating our code
The Upgrader: how to run it and roughly how it’s organized
Custom Transforms: extensions I made for the upgrader to work on GitHub-specific code
Fixes By Hand: bits of code that needed more work (some of these could be automated, but aren’t yet!)
Porting Relay Types: using the class-based API for connections and edges
Migrating DSL extensions: how to support custom GraphQL extension in the new API

The Process

GitHub’s type definitions are separated into folders by type, for example: objects/, unions/, enums/ (and mutations/). I worked through them one folder at a time. The objects/ folder was big, so I did it twenty or thirty files at a time.

I had to do interfaces/ last because of the nature of the new class-based schema. Interfaces modules’ methods can’t be added to legacy-style GraphQL object types. So, by doing interfaces last, I didn’t have to worry about this compatibility issue.

Now that I remember it, I did the schema first, and by hand. It was a pretty easy upgrade.

When I started each section, I created a base class by hand. (There is some automated support for this, but I didn’t use it.) Then, I ran the upgrader on some files and tried to run the test suite. There were usually two kinds of errors:

Parse- or load-time errors which prevented the app from booting
Runtime errors which resulted in unexpected behavior or raised errors

About the Upgrader

Here’s an overview of how the upgrader works. After reading the overview, if you want some specific examples, check out the source code.

Running The Upgrader

The gem includes an auto-upgrader, spearheaded by the folks at HackerOne and refined during my use of it. It’s encapsulated in a class, GraphQL::Upgrader::Member.

To use the upgrader, I added a Ruby script to the code base called graphql-update.rb:

# Usage:
#   ruby graphql-update.rb path/to/type_definition.rb
#
# Example:
#   # Upgrade `BlameRange`
#   ruby graphql-update.rb lib/platform/objects/blame_range.rb
#
#   # Upgrade based on a pattern (use quotes)
#   ruby graphql-update.rb "lib/platform/objects/blob_\*.rb"
#
#   # Upgrade one more file in this pattern (use quotes)
#   ruby graphql-update.rb 1 "lib/platform/objects/**.rb"

# Load the upgrader from local code, for easier trial-and-error development
# require "~/code/graphql-ruby/lib/graphql/upgrader/member"
# Load the upgrader from the Gem:
require "graphql/upgrader/member"

# Accept two arguments: next_files (optional), file_pattern (required)
file_pattern = ARGV[0]
if file_pattern =~ /\d+/
  next_files = file_pattern.to_i
  next_files_pattern = ARGV[1]
  "Upgrading #{next_files} more files in #{next_files_pattern}"
  filenames = Dir.glob(next_files_pattern)
else
  filenames = Dir.glob(file_pattern)
  next_files = nil
  puts "Upgrading #{filenames.join(", ")}"
end

# ...
# Lots of custom rules here, see below
# ...

CUSTOM_TRANSFORMS = {
  type_transforms: type_transforms,
  field_transforms: field_transforms,
  clean_up_transforms: clean_up_transforms,
  skip: CustomSkip,
}

upgraded = []
filenames.each do |filename|
  puts "Begin (#{filename})"
  # Read the file into a string
  original_text = File.read(filename)
  # Create an Upgrader with the set of custom transforms
  GraphQL::Upgrader::Member.new(original_text, **CUSTOM_TRANSFORMS)
  # Generate updated text
  transformed_text = upgrader.upgrade
  if transformed_text == original_text
    # No upgrade was performed
  else
    # If the upgrade was successful, update the source file
    File.write(filename, transformed_text)
    upgraded << filename
  end
  puts "Done (#{filename})"
  if next_files && upgraded.size >= next_files
    # We've upgraded as many as we said we would
    break
  end
end
puts "Upgraded #{upgraded.size} files: \n#{upgraded.join("\n")}"

This script has two basic parts:

Using GraphQL::Upgrader::Member with a set of custom transformations
Supporting code: accepting input, counting files, logging, etc

In your own script, you can write whatever supporting code you want. The key part from GraphQL-Ruby is:

# Create an Upgrader with the set of custom transforms
GraphQL::Upgrader::Member.new(original_text, **CUSTOM_TRANSFORMS)
# Generate updated text
transformed_text = upgrader.upgrade

The Pipeline

The upgrader is structured as a pipeline: each step accepts a big string of input and returns a big string of output. Sometimes, a step does nothing and so its returned string is the same as the input string. In general, the transforms consist of two steps:

Check whether the transform applies to the given input
If it does, copy the string and apply a find-and-replace to it (sometimes using RegExp, other times using the excellent parser gem.)

You have a few options for customizing the transformation pipeline:

Write new transforms and add them to the pipeline
Remove transforms from the pipeline
Re-use the built-in transforms, but give them different parameters, then replace the built-in one with your custom instance

(The “pipeline” is just an array of instances or subclasses of GraphQL::Upgrader::Transform.)

We’ll see cases of each below.

Kinds of Transforms

The upgrader accepts several types of transform pipelines:

CUSTOM_TRANSFORMS = {
  type_transforms: type_transforms,
  field_transforms: field_transforms,
  clean_up_transforms: clean_up_transforms,
  skip: CustomSkip,
}

type_transforms are run first, on the entire file.
field_transforms are run second, but they receive parts of the type definition. They receive calls to field, connection, return_field, input_field, and argument. Fine-grained changes to field definition or argument definition go here.
clean_up_transforms are run last, on the entire file. For example, there’s a built-in RemoveExcessWhitespaceTransform which cleans up trailing spaces after other transforms have run.
skip: has a special function: its #skip?(input) method is called and if it returns true, the text is not transformed at all. This allows the transformer to be idempotent: by default, if you run it on the same file over and over, it will update the file only once.

Custom Transforms

Here are some custom transforms applied to our codebase.

Handle a custom type-definition DSL

We had a wrapper around ObjectType.define which attached metadata, linking the object type to a specific Rails model. The helper was called define_active_record_type. I wanted to take this:

module Platform
  module Objects
    Issue = define_active_record_type(-> { ::Issue }) do
      # ...
    end
  end
end

And make it this:

module Platform
  module Objects
    class Issue < Platform::Objects::Base
      model_name "Issue"
      # ...
    end
  end
end

Fortunately, this can be done with a pretty straightforward regular expression substitution. Here’s the transform:

# Create a custom transform for our `define_active_record_type` factory:
class ActiveRecordTypeToClassTransform < GraphQL::Upgrader::Transform
  # Capture: leading whitespace, type name, model name
  FIND_PATTERN = /^( +)([a-zA-Z_0-9:]*) = define_active_record_type\(-> ?\{ ?:{0,2}([a-zA-Z_0-9:]*) ?\} ?\) do/
  # Restructure as a class, using the leading whitespace and adding the `model_name` DSL
  REPLACE_PATTERN = "\\1class \\2 < Platform::Objects::Base\n\\1  model_name \"\\3\""

  def apply(input_text)
    # It's safe to apply this transform to _all_ input,
    # since it's a no-op if `FIND_PATTERN` is missing.
    input_text.sub(FIND_PATTERN, REPLACE_PATTERN)
  end
end

Then, in graphql-update.rb, this transform was put first in the list:

# graphql-update.rb
type_transforms = GraphQL::Upgrader::Member::DEFAULT_TYPE_TRANSFORMS.dup
type_transforms.unshift(ActiveRecordTypeToClassTransform)

Also, for this to work, I added the def self.model_name(name) helper to the base class.

Renaming a Custom Field Method

We have a helper for adding URL fields called define_url_field. I decided to rename this to url_fields, since these days it creates two fields.

The arguments are the same, so it was a simple substitution:

class UrlFieldTransform < GraphQL::Upgrader::Transform
  def apply(input_text)
    # Capture the leading whitespace and the rest of the line,
    # then insert the new name where the old name used to be
    input_text.gsub(/^( +)define_url_field( |\()/, "\\1url_fields\\2")
  end
end

This transform didn’t interact with any other transforms, so I added it to clean_up_transforms, so it would run last:

# Make a copy of the built-in arry
clean_up_transforms = GraphQL::Upgrader::Member::DEFAULT_CLEAN_UP_TRANSFORMS.dup
# Add my custom transform to the end of the array
clean_up_transforms.push(UrlFieldTransform)

Moving DSL methods to keywords

We have a few DSL methods that, at the time, were easier to implement as keyword arguments. (Since then, the API has changed a bit. You can implement DSL methods on your fields by extending GraphQL::Schema::Field and setting that class as field_class on your base Object, Interface and Mutation classes.)

I wanted to transform:

field :secretStuff, types.String do
  visibility :secret
end

To:

field :secretStuff, types.String, visibility: :secret

(Later, a built-in upgrader would change secretStuff to secret_stuff and types.String to String, null: true.)

To accomplish this, I reused a built-in transform, ConfigurationToKwargTransform, adding it to field_transforms:

# Make a copy of the built-in list of defaults
field_transforms = GraphQL::Upgrader::Member::DEFAULT_FIELD_TRANSFORMS.dup
# Put my custom transform at the beginning of the list
field_transforms.unshift(GraphQL::Upgrader::ConfigurationToKwargTransform.new(kwarg: "visibility"))

In fact, there were several configuration methods moved this way.

Custom Skip

As I was working through the code, some files were tougher than others. So, I decided to skip them. I decided that a magic comment:

# @skip-auto-upgrade

would cause a file to be skipped. To implement this, I made a custom skip class:

class CustomSkip < GraphQL::Upgrader::SkipOnNullKeyword
  def skip?(input_text)
    super(input_text) || input_text.include?("@skip-auto-upgrade")
  end
end

And passed it as skip: to the upgrader. Then, later, I removed the comment and tried again. (Fortunately, my procrastination paid off because the upgrader was improved in the meantime!)

Fixes by Hand

As I worked, I improved the upgrader to cover as many cases as I could, but there are still a few cases that I had to upgrade by hand. I’ll list them here. If you’re really dragged down by them, consider opening an issue on GraphQL-Ruby to talk about fixing them. I’m sure they can be fixed, I just didn’t get to it!

If you want to fix one of these issues, try to replicate the issue by adding to an example spec/fixtures/upgrader and then getting a failing test. Then, you could update the upgrader code to fix that broken test.

Accessing Arguments By Method

Arguments could be accessed by method to avoid typos. However, now, since arguments are a Ruby keyword hash, they don’t have methods corresponding to their keys.

Unfortunately, the upgrader doesn’t do anything about this, it just leaves them there and you get a NoMethodError on Hash.

This could almost certainly be fixed by improving this find-and-replace in ResolveProcToMethodTransform:

# Update Argument access to be underscore and symbols
# Update `args[...]` and `args.key?`
method_body = method_body.gsub(/#{args_arg_name}(?\.key\?\(?|\[)["':](?[a-zA-Z0-9_]+)["']?(?\]|\))?/) do
 # ...
end

It only updates a few methods on args, but I bet a similar find-and-replace could replace other method calls, too.

Argument Usages Outside of Type Definitions

Sometimes, we take GraphQL arguments and pass them to helper methods:

resolve ->(obj, args, ctx) {
  Some::Helper.call(obj, args)
}

However when this was transformed to:

def do_stuff(**arguments)
  Some::Helper.call(@object, arguments)
end

It would break, because the new arguments value is a Ruby hash with underscored, symbol keys. So, if Some::Helper was using camelized strings to get values, it would stop working.

The upgrader can’t really do anything there, since it’s not analyzing the codebase. In my case, these were readily apparent because of failing tests, so I went and fixed them.

context.add_error

We have some fields that add to the "errors" key and return values, they used ctx.add_error to do so:

resolve ->(obj, args, ctx) {
  begin
    obj.count_things
  rescue BackendIsBrokenError
    ctx.add_error(GraphQL::ExecutionError.new("Not working!"))
    0
  end
}

When upgraded, it doesn’t work quite right:

def count_things
  begin
    @object.count_things
  rescue BackendIsBrokenError
    @context.add_error(GraphQL::ExecutionError.new("Not working!"))
    0
  end
end

(If you don’t have to return a value, use raise instead, then you can stop reading this part!)

The problem is that @context is not a field-specific context anymore. Instead, it’s the query-level context. (This is downside of the new API: we don’t have a great way to pass in the field context anymore.)

To address this kind of issues, field accepts a keyword called extras:, which contains a array of symbols. In the case above, we could use :execution_errors:

field :count_things, Integer, null: false, extras: [:execution_errors]
def count_things(execution_errors:)
  @object.count_things
rescue BackendIsBrokenError
  execution_errors.add("Not working!")
  0
end

So, execution_errors was injected into the field as a keyword. It is field-level, so adding errors there works as before.

Other extras are :irep_node, :parent, :ast_node, and :arguments. It’s a bit of a hack, but we need something for this!

Accessing Connection Arguments

By default, connection arguments (like first, after, last, before) are not passed to the Ruby methods for implementing fields. This is because they’re generally used by the automagical (😖) connection wrappers, not the resolve functions.

But, sometimes you just need those old arguments!

If you use extras: [:arguments], the legacy-style arguments will be injected as a keyword:

# `arguments` is the legacy-style Query::Arguments instance
# `field_arguments` is a Ruby hash with symbol, underscored keys.
def things(arguments:, **field_arguments)
  arguments[:first] # => 5
  # ...
end

Fancy String Descriptions

The upgrader does fine when the description is a "..." or '...' string. But in other cases, it was a bit wacky.

Strings built up with + or \ always broke. I had to go back by hand and join them into one string.

Heredoc strings often worked, but only by chance. For example:

field :stuff, types.Int do
  description <<~MD
    Here's the stuff
  MD
end

Would be transformed to:

field :stuff, Integer, description: <<~MD, null: true
    Here's the stuff
  MD

This is valid Ruby, but a bit tricky. This could definitely be improved: since I started my project, GraphQL 1.8 was extended to support description as a method as well as a keyword. So, the upgrader could be improved to leave descriptions in place if they’re fancy strings.

Removed Comments From the Start of Resolve Proc

I hacked around with the parser gem to transform resolve procs into instance methods, but there’s a bug. A proc like this:

resolve ->(obj, args, ctx) {
  # Do stuff
  obj.do_stuff { stuff }
}

Will be transformed to:

def stuff
  @object.do_stuff { stuff }
end

Did you see how the comment was removed? I think I’ve somehow wrongly detected the start of the proc body, so that the comment was left out.

In my case, I re-added those comments by hand. But it could probably be fixed in GraphQL::Upgrader::ResolveProcToMethodTransform.

Hash Reformating?

I’m not sure why, but sometimes a hash of arguments like:

obj.do_stuff(
  a: 1,
  b: 2,
  c: 3,
  d: 4,
)

would be reorganized to

obj.do_stuff(
  a: 1,
  b: 2, c: 3, d: 4,
)

I have no idea why, and I didn’t look into it, I just fixed it by hand.

Issues with Connection DSL

We have a DSL for making connections, like:

Connections.define(Objects::Issue)

Sometimes, when this connection was inside a proc, it would be wrongly transformed to:

field :issues, Connections.define(Objects::Issue) }, ,null: true

This was invalid Ruby, so the app wouldn’t boot, and I would fix it by hand.

Porting Relay Types

Generating connection and edge types with the .connection_type/.define_connection and .edge_type/.define_edge methods will work fine with the new API, but if you want to migrate them to classes, you can do it.

It’s on my radar because I want to remove our DSL extensions, and that requires updating our custom connection edge types.

Long story, short, it Just Work™ed with the class-based API. The approach was:

Add a base class inheriting from our BaseObject
Use the new base class’s def self.inherited hook to add connection- and edge-related behaviors
Run the upgrader on edge and connection types, then go back and do some manual find-and-replaces to make them work right

So, I will share my base classes in case that helps. Sometime it will be nice to upstream this to GraphQL-Ruby, but I’m not sure how to do it now.

Base connection class:

module Platform
  module Connections
    class Base < Platform::Objects::Base
      # For some reason, these are needed, they call through to the underlying connection wrapper.
      extend Forwardable
      def_delegators :@object, :cursor_from_node, :parent

      # When this class is extended, add the default connection behaviors.
      # This adds a new `graphql_name` and description, and searches
      # for a corresponding edge type.
      # See `.edge_type` for how the fields are added.
      def self.inherited(child_class)
        # We have a convention that connection classes _don't_ end in `Connection`, which
        # is a bit confusing and results in naming conflicts.
        # To avoid a GraphQL conflict, override `graphql_name` to end in `Connection`.
        type_name = child_class.name.split("::").last
        child_class.graphql_name("#{type_name}Connection")

        # Use `require_dependency` so that the types will be loaded, if they exist.
        # Otherwise, `const_get` may reach a top-level constant (eg, `::Issue` model instead of `Platform::Objects::Issue`).
        # That behavior is removed in Ruby 2.5, then we can remove these require_dependency calls too.
        begin
          # Look for a custom edge whose name matches this connection's name
          require_dependency "lib/platform/edges/#{type_name.underscore}"
          wrapped_edge_class = Platform::Edges.const_get(type_name)
          wrapped_node_class = wrapped_edge_class.fields["node"].type
        rescue LoadError => err
          # If the custom edge file doesn't exist, look for an object
          begin
            require_dependency "lib/platform/objects/#{type_name.underscore}"
            wrapped_node_class = Platform::Objects.const_get(type_name)
            wrapped_edge_class = wrapped_node_class.edge_type
          rescue LoadError => err
            # Assume that `edge_type` will be called later
          end
        end

        # If a default could be found using constant lookups, generate the fields for it.
        if wrapped_edge_class
          if wrapped_edge_class.is_a?(GraphQL::ObjectType) || (wrapped_edge_class.is_a?(Class) && wrapped_edge_class < Platform::Edges::Base)
            child_class.edge_type(wrapped_edge_class, node_type: wrapped_node_class)
          else
            raise TypeError, "Missed edge type lookup, didn't find a type definition: #{type_name.inspect} => #{wrapped_edge_class.inspect}"
          end
        end
      end

      # Configure this connection to return `edges` and `nodes` based on `edge_type_class`.
      #
      # This method will use the inputs to create:
      # - `edges` field
      # - `nodes` field
      # - description
      #
      # It's called when you subclass this base connection, trying to use the
      # class name to set defaults. You can call it again in the class definition
      # to override the default (or provide a value, if the default lookup failed).
      def self.edge_type(edge_type_class, edge_class: GraphQL::Relay::Edge, node_type: nil)
        # Add the edges field, can be overridden later
        field :edges, [edge_type_class, null: true],
          null: true,
          description: "A list of edges.",
          method: :edge_nodes,
          edge_class: edge_class

        # Try to figure out what the node type is, if it wasn't provided:
        if node_type.nil?
          if edge_type_class.is_a?(Class)
            node_type = edge_type_class.fields["node"].type
          elsif edge_type_class.is_a?(GraphQL::ObjectType)
            # This was created with `.edge_type`
            node_type = Platform::Objects.const_get(edge_type_class.name.sub("Edge", ""))
          else
            raise ArgumentError, "Can't get node type from edge type: #{edge_type_class}"
          end
        end

        # If it's a non-null type, remove the wrapper
        if node_type.respond_to?(:of_type)
          node_type = node_type.of_type
        end

        # Make the `nodes` shortcut field, which can be overridden later
        field :nodes, [node_type, null: true],
          null: true,
          description: "A list of nodes."

        # Make a nice description
        description("The connection type for #{node_type.graphql_name}.")
      end

      field :page_info, GraphQL::Relay::PageInfo, null: false, description: "Information to aid in pagination."

      # By default this calls through to the ConnectionWrapper's edge nodes method,
      # but sometimes you need to override it to support the `nodes` field
      def nodes
        @object.edge_nodes
      end
    end
  end
end

Base edge class:

module Platform
  module Edges
    class Base < Platform::Objects::Base
      # A description which is inherited and may be overridden
      description "An edge in a connection."

      def self.inherited(child_class)
        # We have a convention that edge classes _don't_ end in `Edge`,
        # which is a little bit confusing, and would result in a naming conflict by default.
        # Avoid the naming conflict by overriding `graphql_name` to include `Edge`
        wrapped_type_name = child_class.name.split("::").last
        child_class.graphql_name("#{wrapped_type_name}Edge")
        # Add a default `node` field, assuming the object type name matches.
        # If it doesn't match, you can override this in subclasses
        child_class.field :node, "Platform::Objects::#{wrapped_type_name}", null: true, description: "The item at the end of the edge."
      end

      # A cursor field which is inherited
      field :cursor, String,
        null: false,
        description: "A cursor for use in pagination."
    end
  end
end

Migrating DSL Extensions

We have several extensions to the GraphQL-Ruby .define DSL, for example, visibility controls who can see certain types and fields and scopes maps OAuth scopes to GraphQL types.

The difficulty in porting extensions comes from the implementation details of the new API. For now, definition classes are factories for legacy-style type instances. Each class has a .to_graphql method which is called once to return a legacy-style definition. To maintain compatibility, you have to either:

Modify the derived legacy-style definition to reflect configurations on the class-based definition; OR
Update your runtime code to stop checking for configurations on the legacy-style definition and start checking for configurations on the class-based definition.

Eventually, legacy-style definitions will be phased out of GraphQL-Ruby, but for now, they both exist in this way in order to maintain backwards compatibility and gradual adoptability.

In the mean time, you can go between class-based and legacy-style definitions using .graphql_defintion and .metadata[:type_class], for example:

class Project < BaseObject
  # ...
end

legacy_type = Project.graphql_definition
# # instance
legacy_type.metadata[:type_class]
# `Project` class

The Easy Way: `.redefine`

The easiest way to retain compatibility is to:

Add a class method to your base classes which accept some configuration and put it in instance variables
Override .to_graphql to call super, and then pass the configuration to defn.redefine(...), then return the redefined type.

After my work on our code, I extracted this into a backport of accepts_definition

You can take that approach for a try, for example:

class BaseObject < GraphQL::Schema::Object
  # Add a configuration method
  def self.visibility(level)
    @visibility = level
  end

  # Re-apply the configuration
  def self.to_graphql
    type_defn = super
    # Call through to the old extension:
    type_defn = type_defn.redefine(visibilty: @visibility)
    # Return the redefined type:
    type_defn
  end
end

# Then, use it in type definitions:
class Post < BaseObject
  visibility(:secret)
end

The Hard Way: `.metadata[:type_class]`

An approach I haven’t tried yet, but I will soon, is to move the “source of truth” to the the class-based definition. The challenge here is that class-based definitions are not really used during validation and execution, so how can you reach configuration values on those classes?

The answer is that if a legacy-style type was derived from a class, that class is stored as metadata[:type_class]. For example:

class Project < BaseObject
  # ...
end
legacy_defn = Project.graphql_definition # Instance of GraphQL::ObjectType, just like `.define`
legacy_defn.metadata[:type_class] # `Project` class from above

So, you could update runtime code to read configurations from type_defn.metadata[:type_class].

Importantly, metadata[:type_class] will be nil if the type wasn’t derived from a class, so this approach is tough to use if some definitions are still using the .define API.

I haven’t implemented this yet, but I will be doing it in the next few weeks so we can simplify our extensions and improve boot time.

The End

I’m still wrapping up some loose ends in the codebase, but I thought I’d share these notes in case they help you in your upgrade. If you run into trouble on anything mentioned here, please open an issue on GraphQL-Ruby! I really want to support a smooth transition to this new API.

Why a New Schema Definition API?

2018-03-25T13:59:00-04:00

GraphQL-Ruby 1.8.0 will have a new class-based API for defining your schema. Let’s investigate the design choices in the new API.

The new API is backwards-compatible and can coexist with type definitions in the old format. See the docs for details. 1.8.0.pre versions are available on RubyGems now and are very stable – that’s what we’re running at GitHub!

Problems Worth Fixing

Since starting at GitHub last May, I’ve entered into the experience of a huge-scale GraphQL system. Huge scale in lots of ways: huge schema, huge volume, and huge developer base. One of the problems that stood out to me (and to lots of us) was that GraphQL-Ruby simply didn’t help us be productive. Elements of schema definition hindered us rather than helped us.

So, our team set out on remaking the GraphQL-Ruby schema definition API. We wanted to address a few specific issues:

Familiarity. GraphQL-Ruby’s schema definition API reflected GraphQL and JavaScript more than it reflected Ruby. (The JavaScript influence comes from graphql-js, the reference implementation.) Ruby developers couldn’t bring their usual practices into schema development; instead, they had to learn a bunch of new APIs and figure out how to work them together.
Rails Compatibility, especially constant loading. A good API would work seamlessly with Rails development configurations, but the current API has some gotchas regarding circular dependencies and reloading.
Hackability. Library code is fine until it isn’t, and one of the best (and worst) things about Ruby is that all code is open to extension (or monkey-patching 🙈). At best, this means that library users can customize the library code in straightforward ways to better suit their use cases. However, GraphQL-Ruby didn’t support this well: to support special use cases, customizations had to be hacked in in odd ways that were hard to maintain and prone to breaking during gem updates.

Besides all that, we needed a safe transition, so it had to support a gradual adoption.

After trying a few different possibilities, the team decided to take a class-based approach to defining GraphQL schemas. I’m really thankful for their support in the design process, and I’m indebted to the folks at Shopify, who used a class-based schema definition system from the start (as a layer on top of GraphQL-Ruby) and presented their work early on.

The new API, from 10,000 feet

In short, GraphQL types used to be singleton instances, built with a block-based API:

Types::Post = GraphQL::ObjectType.define {
  # ...
}

Now, GraphQL types are classes, with a DSL implemented as class methods:

class Types::Post
  # ...
end

Field resolution was previously defined using Proc literals:

field :comments, types[Types::Comments] do
  argument :orderBy, Types::CommentOrder
  resolve ->(obj, args, ctx) {
    obj.comments.order(args[:orderBy])
  }
end

Now, field resolution is defined with an instance method:

field :comments, [Types::Comments], null: true do
  argument :order_by, Types::CommentOrder, required: false
end

def comments(order_by: nil)
  object.comments.order(order_by)
end

How does this address the issues listed above?

More Familiarity

First, using classes reduces the “WTF” factor of GraphQL definition code. A seasoned Ruby developer might (rightly) smell foul play and reject GraphQL-Ruby on principle. (I was not seasoned enough to detect this when I designed the API!)

Proc literals are rare in Ruby, but common in GraphQL-Ruby’s .define { ... } API. Their lexical scoping rules are different than method scoping rules, making it hard to remember what was and wasn’t in scope during field resolution (for example, what was self?). To make matters worse, some of the blocks in the .define API were instance_eval’d, so their self would be overridden. Practically, this meant that typos in development resulted in strange NoMethodErrors.

Proc literals also have performance downsides: they’re not optimized by CRuby, so they’re slower than method calls. Since they capture a lexical scope, they may also have unexpected impacts on memory footprint (any local variable may be retained, since it might be accessed by the proc). The solutions here are simple: just use methods, the way Ruby wants you to! 😬

In the new class-based API, there are no proc literals (although they’re supported for compatibility’s sake). There are some instance_eval’d blocks (field(...) { }, for example), but field resolution is just an instance method and the type definition is a normal class, so module scoping works normally. (Contrast that with the constant assignment in Types::Post = GraphQL::ObjectType.define { ... }, where no module scope is used). Several hooks that were previously specified as procs are now class methods, such as resolve_type and coerce_input (for scalars).

Overriding ! is another particular no-no I’m correcting. At the time, I thought, “what a cool way to bring a GraphQL concept into Ruby!” This is because GraphQL non-null types are expressed with !:

# This field always returns a User, never `null`
author: User!

So, why not express the concept with Ruby’s ! method (which is usually used for negation)?

field :author, !User

As it turns out, there are several good reasons for why not!

Overriding ! breaks the negation operator. ActiveSupport’s .present? didn’t work with type objects, because ! didn’t return false, it returned a non-null type.
Overriding the ! operator throws people off. When a newcomer sees GraphQL-Ruby sample code, they have a WTF moment, followed by the dreadful memory (or discovery) that Ruby allows you to override !.
There’s very little value in importing GraphQL concepts into Ruby. GraphQL-Ruby developers are generally seasoned Ruby developers who are just learning GraphQL, so they don’t gain anything by the similarity to GraphQL.

So, overriding ! didn’t deliver any value, but it did present a roadblock to developers and break some really essential code.

In the new API, nullability is expressed with the options null: and required: instead of with !. (But, you can re-activate that override for compatibility while you transition to the new API.)

By switching to Ruby’s happy path of classes and methods, we can help Ruby developers feel more at home in GraphQL definitions. Additionally, we avoid some unfamiliar gotchas of procs and clear a path for removing the ! override.

Rails Compatibility

Rails’ automatic constant loading is wonderful … until it’s not! GraphQL-Ruby didn’t play well with Rails’ constant loading especially when it came to cyclical dependencies, and here’s why.

Imagine a typical .define-style type definition, like this:

Types::T = GraphQL::ObjectType.define { ... }

We’re assigning the constant Types::T to the return value of .define { ... }. Consequently, the constant is not defined until .define returns.

Let’s expand the example to two type definitions:

Types::T1 = GraphQL::ObjectType.define { ... }
Types::T2 = GraphQL::ObjectType.define { ... }

If T1 depends on T2, and T2 depends on T1, how can this work? (For example, imagine a Post type whose author field returns a User, and a User type whose posts field returns a list of Posts. This kind of cyclical dependency is common!) GraphQL-Ruby’s solution was to adopt a JavaScriptism, a thunk. (Technically, I guess it’s a functional programming-ism, but I got it from graphql-js.) A thunk is an anonymous function used to defer the resolution of a value. For example, if we have code like this:

field :author, Types::User
# NameError: uninitialized constant Types::User

GraphQL-Ruby would accept this:

field :author, -> { Types::User }
# Thanks for the function, I will call it later to get the value!

Later, GraphQL-Ruby would .call the proc and get the value. At that type, Types::User would properly resolve to the correct type. This worked but it had two big downsides:

It added an unfamiliar construct (Proc) in an unfamiliar context (a method argument), so it was frustrating and disorienting.
It added visual noise to the source code.

How does switching to classes resolve this issue? To ask the same question, how come we don’t experience this problem with normal Rails models?

Part of the answer has to do with how classes are evaluated. Consider two classes in two different files:

# app/graphql/types/post.rb
module Types
  class Post < BaseObject
    field :author, Types::User, null: false
  end
end
# app/graphql/types/user.rb
module Types
  class User < BaseObject
    field :posts, [Types::Post], null: false
  end
end

Notice that Post depends on User, and User depends on Post. The difference is how these lines are evaluated, and when the constants become defined. Here’s the same code, with numbering to indicate the order that lines are evaluated:

# Let's assume that `Post` is loaded first.
# app/graphql/types/post.rb
module Types                                  # 1, evaluation starts here
  class Post < BaseObject                     # 2, and naturally flows here, constant `Types::Post` is initialized as a class extending BaseObject
    field :author, Types::User, null: false   # 3, but when evaluating `Types::User`, jumps down below
  end                                         # 9, execution resumes here after loading `Types::User`
end                                           # 10
# app/graphql/types/user.rb
module Types                                  # 4, Rails opens this file looking for `Types::User`
  class User < BaseObject                     # 5, constant `Types::User` is initialized
    field :posts, [Types::Post], null: false  # 6, this line finishes without jumping, because `Types::Post` is _already_ initialized (see `# 2` above)
  end                                         # 7
end                                           # 8

Since Types::Post is initialized first, then built-up by the following lines of code, it’s available to Types::User in the case of a circular dependency. As a result, the thunk is not necessary.

This approach isn’t a silver bullet – Types::Post is not fully initialized by the time Types::User needs it – but it reduces visual friction and generally plays nice with Rails out of the box.

Hackability

I’ve used a naughty word here, but in fact, I’m talking about something very good. Have you ever been stuck with some dependency that didn’t quite fit your application? (Or, maybe you were stuck on an old version, or your app needed a new feature that wasn’t quite supported by the library.) Like it or not, sometimes the only way forward in a case like that is to hack it: reopen classes, redefine methods, mess with the inheritance chain, etc. Yes, those choices come with maintenance downsides, but sometimes they’re really the best way forward.

On the other hand, really flexible libraries are ready for you to come and extend them. For example, they might provide base classes for you to extend, with the assumption that you’ll override and implement certain methods. In that case, the same hacking techniques listed above have found their time to shine.

ActiveRecord::Base is a great example of both cases: plenty of libraries hack methods right into the built-in class (for example, acts_as_{whatever}), and also, lots of Rails apps use an ApplicationRecord class for their application-specific customizations.

Since GraphQL-Ruby didn’t use the familiar arrangement of classes and methods, it was closed to this kind of extension. (Ok, you could do it, but it was a lot of work! And who wants to do that!?) In place of this, GraphQL-Ruby had yet-another-API for extending its DSL. Yet another thing to learn, with more Proc literals 😪.

Using classes simplifies this process because you can use familiar Ruby techniques to build your GraphQL schema. For example, if you want to share code between field resolvers, you can include a module and call its methods. If you want to make shorthands for common cases in your app, you can use your Base type classes. If you want to add special configuration to your types, you can use class methods. And, whenever that day should come, when you need to monkey-patch GraphQL-Ruby internals, I hope you’ll be able to find the right spot to do it!

Stay Classy

GraphQL-Ruby is three years old now, and I’ve learned a LOT during that time! I’m really thankful for the opportunity to focus on developer productivity in the last few months, learning how I’ve prevented it and working on ways to improve it. I hope to keep working on topics like this – how to make GraphQL more productive for Ruby developers – in the next year, especially, so if you have feedback on this new API, please open an issue to share it!

I’m excited to see how this new API changes the way people think about GraphQL in Ruby, and I hope it will foster more creativity and stability.

Ruby Type Checking Roundup

2017-10-06T09:00:00-04:00

This fall, several people presented their work on Ruby type checkers. So let’s take a look: what’s the big deal, and what have they been up to?

Why Type Check?

Part of Ruby’s appeal is to be free of the cruft of its predecessors. So why is there so much interest in adding types to Ruby?

Large, sprawling projects are becoming more common. At Ruby’s inception, there were no 10-year-old Rails apps which people struggled to maintain, only greenfield Ruby scripts for toy projects.
Programmers have experienced excellent type systems in other languages, and want those benefits in Ruby.
Optional, gradual type systems have been introduced to Python and JavaScript and they’re big successes.

What are the benefits?

Correctness: Type checking, like testing, is a way to be confident that your codebase is functioning properly. Employing a type checker can help you find bugs during development and prevent those bugs from going to production.
Confidence: Since an incorrect program won’t pass type checking, developers can refactor with more confidence. Common errors such as typos and argument errors can be caught by the type checker.
Design: The type system gives you a way to think about the program. Specifically, types document and define the boundaries between parts of code, like methods, classes and modules.

To experience a great type system in a Ruby-like language, I recommend Crystal.

Jeff Foster, StrangeLoop 2017

Jeff Foster is a professor at the University of Maryland, College Park and works in the programming languages group. Along with his students, he’s been exploring Ruby type checkers for nine years! This year, he gave a presentation at StrangeLoop, Type Checking Ruby.

He described his various avenues of research over the years, and how they influenced one another, leading to a final question:

class Talk < ActiveRecord::Base
  belongs_to :owner, class_name: "User"

  def owner?(other_user)
    # QUESTION
    # How to know the type of `#owner` method at this point?
    owner == other_user
  end
end

His early work revolved around static type checking: annotations in the source code were given to a type checker, which used those annotations to assert that the Ruby code was correct.

This approach had a fundamental limitation: how can dynamically-created methods (like Talk#owner above) be statically annotated?

This drove him and his team to develop RDL, a dynamic type checker. In RDL, types are declared using methods instead of annotations, for example:

type '(Integer, Integer) -> Integer'
def multiply(x, y)
  x * y
end

By using methods, it handles metaprogramming in a straightforward way. It hooks into Rails’ .belongs_to and adds annotations for the generated methods, for example:

# Rails' belongs_to method
def belongs_to(name, options = {})
  # ...
  # define a reader method, like `Talk#owner` above
  type "() -> #{class_name}"
  define_method(name) do
    # ...
  end
end

(In reality, RDL uses conditions, not monkey-patching, to achieve this.)

In this approach, type information is gathered while the program runs, but the typecheck is deferred until the method is called. At that point, RDL checks the source code (static information) using the runtime data (dynamic information). For this reason, RDL is called “Just-in-Time Static Type Checking.”

You can learn more about RDL in several places:

RDL on GitHub: https://github.com/plum-umd/rdl
StrangeLoop 2017 talk: https://www.youtube.com/watch?v=buY54I7mEjA
Academic papers from the folks behind RDL: https://github.com/plum-umd/rdl#bibliography

Personally, I can’t wait to take RDL for a try. At the conference, Jeff mentioned that type inference was on his radar. That would take RDL to the next level!

Not to read into it too far, but it looks like Stripe is exploring RDL 😎.

Soutaro Matsumoto, RubyKaigi 2017

Soutaro Matsumoto also has significant academic experience with type checking Ruby, and this year, he presented some of his work at RubyKaigi in Type Checking Ruby Programs with Annotations.

He begins with an overview of type checking Ruby, and surveys the previous work in type inference. He also points out how requirements should be relaxed for Ruby:

~~Correctness~~ -> Forget correctness (Allow a mix of typed and untyped code, so that developers can work quickly when they don’t want or need types.)
~~Static~~ -> Defer type checking to runtime (He mentions RDL in this context)
~~No annotations~~ -> Let programmers write types (Completely inferring types is not possible, so accept some hints from the developers.)

Then, he introduces his recent project, Steep.

Steep’s approach is familiar, but new to Ruby. It has three steps:

Write a .rbi file which describes the types in your program, using a special type language, for example:

class Talk {
  def owner: (User) -> _Boolean
}

Add annotations to your Ruby code to connect it to your types:

class Talk < ActiveRecord::Base
  belongs_to :owner, class_name: "User"
  # @dynamic owner
end

Some connections between Ruby source and the .rbi files can be made automatically; others require explicit annotations.

Run the type checker:
```
$ steep check app/models/talk.rb
```

It reminds me a bit of the .h/.c files in a C project.

Soutaro is also presenting his work at this winter’s RubyConf.

Valentin Fondaratov, RubyKaigi 2017

Valentin works at JetBrains (creators of RubyMine) and presented his work on type-checking based on runtime data. His presentation, Automated Type Contracts Generation for Ruby, was really fascinating and offered a promising glimpse of what a Ruby type ecosystem could be.

Valentin started by covering RubyMine’s current type checking system:

RubyMine tries to resolve identifiers (eg, method names, constant names) to their implementations
But this is hard: given obj.execute, what method does it call?
Developers can provide hints with YARD documentation
RubyMine uses this to support autocomplete, error prediction, and rename refactorings

He also pointed out that even code coverage is not enough: 100% code coverage does not guarantee that all possible codepaths were run. For example, any composition of if branches require a cross-product of codepaths, not only that each line is executed once. Besides that, code coverage does not analyze the coverage of your dependencies’ code (ie, RubyGems).

So, Valentin suggests getting more from our unit tests: what if we observed the running program, and kept notes about what values were passed around and how they were used? In this arrangement, that runtime data could be accumulated, then used for type checking.

Impressively, he introduced the implementation of this, first using a TracePoint, then digging into the Ruby VM to get even more granular data.

However, the gathered data can be very complicated. For example, how can we understand the input type of String#split?

# A lot of type checking data generated at runtime:
# call                                # Input type
"1,2,,3,4,,".split(",")               # (String, nil)
# => ["1", "2", "", "3", "4"]
"1,2,,3,4,,".split(",", 4)            # (String, Integer)
# => ["1", "2", "", "3,4,,"]
"1,2,,3,4,,".split(",", -4)           # (String, Integer)
# => ["1", "2", "", "3", "4", "", ""]
"1,2,,3,4,,".split(/\d/)              # (Regexp, nil)
# => ["", ",", ",,", ",", ",,"]
# ...

Valentin showed how a classic technique, finite automata, can be used to reduce this information to a useful data structure.

Then, this runtime data can be used to generate type annotations (as YARD docs).

Finally, he imagines a type ecosystem for Ruby:

Users contribute their (anonymized) runtime information for their RubyGem depenedencies
This data is pooled into a shared database, merged by RubyGem & version
Users can draw type data from the shared database

Personally, I think this is a great future to pursue:

Developers can gain type checking without any annotations
Annotations can become very robust because resources are shared
Real 100% coverage is possible via community collaboration

You can see the project on GitHub: https://github.com/JetBrains/ruby-type-inference

Summary

There’s a lot of technically-savvy and academically-informed work on type checking Ruby! Many of the techniques preserve Ruby’s productivity and dynamism while improving the developer experience and confidence. What makes them unique is their use of runtime data, to observe the program in action, then make assertions about the source code.

What's new in React-Rails 2.0?

2017-04-13T11:59:00-04:00

For Planning Center free week, I cooked up react-rails 2.0 🎊.

Here are a few highlights. For the full list, see the changelog!

Webpacker support

Webpacker was great to work with. react-rails now supports webpacker for:

Mounting components with <%= react_component(...) %> via require
Server rendering from a webpacker pack (server_rendering.js)
Loading the unobtrusive JavaScript (UJS)
Installation and component generators

A nice advantage of using webpacker is that you can load React.js from NPM instead of the react-rails gem. This way, you aren’t bound to the React.js version which is included with the Ruby gem. You can pick any version you want!

UJS on npm

To support frontends built with Node.js, react-rails’s UJS driver is available on NPM as react_ujs. It performs setup during require, so these two are equal:

// Sprockets:
//= require react_ujs

// Node, etc:
require("react_ujs")

Request-based prerender context

If you’re prerendering your React components on the server, you can perform setup and teardown in your Rails controller. For example, you might use these hooks to populate a flux store.

First, add the per_request_react_rails_prerenderer helper to your controller:

class PagesController < ApplicationController
  per_request_react_rails_prerenderer
  # ...
end

Then, you can access react_rails_prerenderer in the controller action:

def show
  js_context = react_rails_prerenderer.context
  js_context.exec(js_setup_code)
  render :show
  js_context.exec(js_teardown_code)
end

That way, you can properly prepare & clean up a JS VM for server rendering.

Re-detect events

Previously, ReactRailsUJS “automatically” detected which libraries you were using and hooked up to their events for rendering components.

It still checks for libraries during its initial load, but you can also re-check as needed:

// Check the global context for libraries like Turbolinks and hook up to them:
ReactRailsUJS.detectEvents()

This function removes previous event handlers, so it’s safe to call anytime. (This was added in 2.0.2.)

Other Takeaways

See the changelog for bug fixes and a new default server rendering configuration.

Webpacker is great! Setup was smooth and the APIs were clear and convenient. I’m looking forward to using it more.

🍻 Here’s to another major version of react-rails!

Watching files during Rails development

2017-04-12T14:09:00-04:00

You can tell Ruby on Rails to respond to changes in certain files during development.

Rails knows to watch config/routes.rb for changes and reload them when the files change. You can use the same mechanism to watch other files and take action when they change.

I used this feature for react-rails server rendering and for GraphQL::Pro static queries.

app.reloader

Every Rails app has a @reloader, which is a local subclass of ActiveSupport::Reloader. It’s used whenever you call reload! in the Rails console.

It’s attached to a rack middleware which calls #run! (which, in turn, calls the reload blocks if it detects changes).

config.to_prepare

You can add custom preparation hooks with config.to_prepare:

initializer :my_custom_preparation do |app|
  config.to_prepare do
    puts "Reloading now ..."
  end
end

When Rails detects a change, this block will be called. It’s implemented by registering the block with app.reloader.

app.reloaders

To add new conditions for which Rails should reload, you can add to the app.reloaders array:

# Object responds to `#updated?`
class MyWatcher
  def updated?
    # ...
  end
end

# ...

initializer :my_custom_watch_condition do |app|
  # Register custom reloader:
  app.reloaders << MyWatcher.new
end

The object’s updated? method will be called by the reloader. If any reloader returns true, the middleware will run all to_prepare blocks (via the call to @reloader.run!).

FileUpdateChecker

Rails includes a goodie for watching files. ActiveSupport::FileUpdateChecker is great for:

Watching specific files for changes (config/routes.rb is watched this way)
Watching a directory of files for changes, additions and deletions (app/**/*.rb is watched this way)

You can create your own FileUpdateChecker and add it to app.reloaders to reload Rails when certain files change:

# Watch specific files:
app.reloaders << ActiveSupport::FileUpdateChecker.new(["my_important_file.txt", "my_other_important_file.txt"])
# Watch directory-extension pairs, eg all `.txt` and `.md` files in `app/important_files` and subdirectories:
app.reloaders << ActiveSupport::FileUpdateChecker([], { "app/important_files" => [".txt", ".md"] })

Some filesystems support an evented file watcher implementation, ActiveSupport::EventedFileUpdateChecker. app.config.file_watcher will return the proper filewatcher class for the current context.

app.reloaders << app.config.file_watcher(["my_important_file.txt", "my_other_important_file.txt"])

All Together Now

react-rails maintains a pool of V8 instances for server rendering React components. These instances are initialized with a bunch of JavaScript code, and whenever a developer changes a JavaScript file, we need to reload them with the new code. This requires two steps:

Adding a new watcher to app.reloaders to detect changes to JavaScript files
Adding a to_prepare hook to reload the JS instances

It looks basically like this:

initializer "react_rails.watch_js_files" do |app|
  # Watch for changes to javascript files:
  app.reloaders << app.config.file_watcher.new([], {
    # Watch the asset pipeline:
    Rails.root.join("app/assets/javascripts").to_s => ["jsx", "js"],
    # Watch webpacker:
    Rails.root.join("app/javascript").to_s => ["jsx", "js"]
  })

  config.to_prepare do
    React::ServerRendering.reset_pool
  end
end

The full implementation supports some customization. You can see similar (and more complicated) examples with routes reloading, i18n reloading and .rb reloading.

Happy reloading!

Prototyping a GraphQL Schema From Definition With Ruby

2017-03-17T15:49:00-04:00

GraphQL 1.5.0 includes a new way to define a schema: from a GraphQL definition.

In fact, loading a schema this way has been supported for while, but 1.5.0 adds the ability to specify field resolution behavior.

GraphQL IDL

Besides queries, GraphQL has an interface definition language (IDL) for expressing a schema’s structure. For example:

schema {
  query: Query
}

type Query {
  post(id: ID!): Post
}

type Post {
  title: String!
  comments: [Comment!]
}

You can turn a definition into a schema with Schema.from_definition:

schema_defn = "..."
schema = GraphQL::Schema.from_definition(schema_defn)

(By the way, the IDL is technically in RFC stage.)

Resolvers

Schema.from_definition also accepts default_resolve: argument. It expects one of two inputs:

A nested hash of type Hash Hash #call(obj, args, ctx)>>; or
An object that responds to #call(type, field, obj, args, ctx)

Resolving with a Hash

When you’re using a hash:

The first key is a type name
The second key is a field name
The last value is a resolve function (#call(obj, args, ctx))

To get started, you can write the hash manually:

{
  "Query" => {
    "post" => ->(obj, args, ctx) { Post.find(args[:id]) },
  },
  "Post" => {
    "title" => ->(obj, args, ctx) { obj.title },
    "body" => ->(obj, args, ctx) { obj.body },
    "comments" => ->(obj, args, ctx) { obj.comments },
  },
}

But you can also reduce a lot of boilerplate by using a hash with default values:

# This hash will fall back to default implementation if another value isn't provided:
type_hash = Hash.new do |h, type_name|
  # Each type gets a hash of fields:
  h[type_name] = Hash.new do |h2, field_name|
    # Default resolve behavior is `obj.public_send(field_name, args, ctx)`
    h2[field_name] = ->(obj, args, ctx) { obj.public_send(field_name, args, ctx) }
  end
end

type_hash["Query"]["post"] = ->(obj, args, ctx) { Post.find(args[:id]) }

schema = GraphQL::Schema.from_definition(schema_defn, default_resolve: type_hash)

Isn’t that a nice way to set up a simple schema?

Resolving with a Single Function

You can provide a single callable that responds to #call(type, field, obj, args, ctx). What a mouthful!

The advantage of that hefty method signature is that it’s enough to specify any resolution behavior you can imagine. For example, you could create a system where type modules were found by name, then methods were called by name:

module ExecuteGraphQLByConvention
  module_function
  # Find a Ruby module corresponding to `type`,
  # then call its method corresponding to `field`.
  def call(type, field, obj, args, ctx)
    type_module = Object.const_get(type.name)
    type_module.public_send(field.name, obj, args, ctx)
  end
end

schema = GraphQL::Schema.from_definition(schema_defn, default_resolve: ExecuteGraphQLByConvention)

So, a single function combined with Ruby’s flexibility and power opens a lot of doors!

Doesn’t it remind you a bit of method dispatch? The arguments are:

GraphQL Field Resolution	Method Dispatch
`type`	class
`field`	method
`obj`	receiver
`args`	method arguments
`ctx`	runtime state (cf `mrb_state`, `RedisModuleCtx`, or `ErlNifEnv`)

Special Configurations

Some schemas need other configurations in order to run:

resolve_type to support union and interface types
schema plugins like monitoring or custom instrumentation

To add these to a schema, use .redefine:

# Extend the schema with new definitions:
schema = schema.redefine {
  resolve_type ->(obj, ctx) { ... }
  monitoring :appsignal
}

What’s Next?

Rails has proven that “Convention over Configuration” can be a very productive way to start new projects, so I’m interested in exploring convention-based APIs on top of this feature.

In the future, I’d like to add support for schema annotations in the form of directives, for example:

type Post {
  comments: [Comment!] @relation(hasMany: "comments")
}

These could be used to customize resolution behavior. Cool!

Tracking Schema Changes with GraphQL-Ruby

2017-03-16T20:16:00-04:00

One way to keep an eye on your GraphQL schema is to check the definition into source control.

When modifying shared code or reconfiguring, it can be hard to tell how the schema will really change. To help with this, set up a snapshot test for your GraphQL schema! This way:

Changes will be clearly visible in GraphQL IDL
You can keep the IDL up-to-date by adding a test to your suite

You can even track the schema from different contexts if you’re using GraphQL::Pro’s authorization framework.

This approach was first described in GraphQL at Shopify.

Check It In

Write a Rake task to get your schema’s definition and write it to a file:

# lib/tasks/graphql.rake
rake dump_schema: :environment do
  # Get a string containing the definition in GraphQL IDL:
  schema_defn = MyAppSchema.to_definition
  # Choose a place to write the schema dump:
  schema_path = "app/graphql/schema.graphql"
  # Write the schema dump to that file:
  File.write(Rails.root.join(schema_path), schema_defn)
  puts "Updated #{schema_path}"
end

You can run it from terminal:

$ bundle exec rake dump_schema
Updated app/graphql/schema.graphql

This updates the file in your repo. Go ahead and check it in!

$ git add app/graphql/schema.graphql
$ git commit -m "Add GraphQL schema dump"

Keep It Up to Date

Any changes to the Ruby schema code must be reflected in the .graphql file. You can give yourself a reminder by adding a test case which asserts that the GraphQL definition is up-to-date:

# test/graphql/my_app_schema_test.rb
require "test_helper"

class MyAppSchemaTest < ActiveSupport::TestCase
  def test_printout_is_up_to_date
    current_defn = MyAppSchema.to_definition
    printout_defn = File.read(Rails.root.join("app/graphql/schema.graphql"))
    assert_equal(current_defn, printout_defn, "Update the printed schema with `bundle exec rake dump_schema`")
  end
end

If the definition is stale, you’ll get a failed test:

This reminder is helpful in development and essential during code review!

Review It

Now that your schema definition is versioned along with your code, you can see changes during code review:

Multiple Schema Dumps

If your schema looks different to different users, you can track multiple schema dumps. This is helpful if:

You’re using the :view configuration of GraphQL::Pro’s authorization
You’re using only:/ except: to manually filter your schema

Just provide the context: argument to Schema.to_definition as if you were running a query. (Also provide only:/except: if you use them.)

Print with a filter from the Rake task:

# lib/tasks/graphql.rake
task dump_schema: :environment do
  # ...
  admin_user = OpenStruct.new(admin?: true)
  admin_schema_dump = MyAppSchema.to_definition(context: { current_user: admin_user })
  admin_schema_path = "app/graphql/admin_schema.graphql"
  File.write(Rails.root.join(admin_schema_path), admin_schema_dump)
end

Test with a filter from the test case:

def test_printout_is_up_to_date
  # ...
  admin_user = OpenStruct.new(admin?: true)
  current_admin_defn = MyAppSchema.to_definition(context: { current_user: admin_user })
  printout_admin_defn = File.read(Rails.root.join("app/graphql/admin_schema.graphql"))
  assert_equal(current_admin_defn, printout_admin_defn, "Update the printed schema with `bundle exec rake dump_schema`")
end

Now you can keep an eye on the schema from several perspectives!

Optimizing GraphQL-Ruby

2017-03-08T08:02:00-05:00

Soon, graphql-ruby 1.5.0 will be released. Query execution will be ~70% faster than 1.3.0!

Let’s look at how we reduced the execution time between those two versions. Thanks to @theorygeek who optimized the middleware chain helped me pinpoint several other bottlenecks!

The Benchmark

To track GraphQL execution overhead, I execute the introspection query on a fixture schema in graphql-ruby’s test suite.

On GraphQL 1.3.0, the benchmark ran around 22.5 iterations per second:

On master, it runs around 38 iterations per second:

That’s almost 1.7x faster!

38.0 / 22.5
# => 1.6888888888888889

So, how’d we do it?

Looking Under the Hood with RubyProf

To find where time was spent, I turned to ruby-prof. I wrapped GraphQL execution with profiling and inspected the result:

Thread ID: 70149906635240
Fiber ID: 70149911114440
Total: 0.474618
Sort by: self_time

 %self      total      self      wait     child     calls  name
60      0.074     0.022     0.000     0.052     6893  *Class#new
99      0.019     0.019     0.000     0.000     8715  *GraphQL::Define::InstanceDefinable#ensure_defined
13      0.015     0.015     0.000     0.000    25403   Module#===
64      0.013     0.013     0.000     0.000     8813   Kernel#hash
49      0.074     0.012     0.000     0.063     3496  *GraphQL::Schema::MiddlewareChain#call
85      0.009     0.009     0.000     0.000     4184   GraphQL::Query::Context::FieldResolutionContext#query
78      0.017     0.008     0.000     0.008     2141   ##type
63      0.008     0.008     0.000     0.000     1960   GraphQL::Query::Context::FieldResolutionContext#initialize
54      0.012     0.007     0.000     0.005     1748   GraphQL::Query#get_field
53      0.014     0.007     0.000     0.006     1748   GraphQL::Query#arguments_for
52      0.007     0.007     0.000     0.000     8356   Kernel#is_a?
51      0.010     0.007     0.000     0.003     7523   Kernel#===
44      0.022     0.007     0.000     0.015     1959   GraphQL::Query::Context::FieldResolutionContext#spawn
32      0.012     0.006     0.000     0.006     1748   GraphQL::Execution::Lazy::LazyMethodMap#get
31      0.010     0.006     0.000     0.003     1748   GraphQL::Execution::FieldResult#value=
29      0.032     0.006     0.000     0.026     1748   GraphQL::Field#resolve
25      0.042     0.006     0.000     0.037     1748   ##resolve
16      0.015     0.006     0.000     0.010     1748   GraphQL::Execution::FieldResult#initialize
06      0.010     0.005     0.000     0.005     2815   GraphQL::Schema::Warden#visible?
05      0.014     0.005     0.000     0.009     1748   GraphQL::Schema::MiddlewareChain#initialize
03      0.005     0.005     0.000     0.000     2815   #call
97      0.014     0.005     0.000     0.009      756   Hash#each_value
# ... truncated ...

A few things stood out:

~5% of time was spent during ~7k calls to Class#new: this is time spent initializing new objects. I think initialization can also trigger garbage collection (if there’s not a spot on the free list), so this may include GC time.
~4% of time was spent during ~9k calls to InstanceDefinable#ensure_defined, which is part of graphql-ruby’s definition API. It’s all overhead to support the definition API, 😿.
Several methods are called 1748 times. Turns out, this is once per field in the response.
With that in mind, 25,403 seems like a lot of calls to Module#===!

Reduce GC Pressure

Since Class#new was the call with the most self time, I thought I’d start there. What kind of objects are being allocated? We can filter the profile output:

~/code/graphql $ cat 130_prof.txt | grep initialize
63      0.008     0.008     0.000     0.000     1960   GraphQL::Query::Context::FieldResolutionContext#initialize
16      0.015     0.006     0.000     0.010     1748   GraphQL::Execution::FieldResult#initialize
05      0.014     0.005     0.000     0.009     1748   GraphQL::Schema::MiddlewareChain#initialize
69      0.006     0.003     0.000     0.002     1833   Kernel#initialize_dup
46      0.002     0.002     0.000     0.000     1768   Array#initialize_copy
30      0.001     0.001     0.000     0.000      419   GraphQL::Execution::SelectionResult#initialize
28      0.001     0.001     0.000     0.000      466   Hash#initialize
17      0.010     0.001     0.000     0.009       92   GraphQL::InternalRepresentation::Selection#initialize
15      0.002     0.001     0.000     0.001      162   Set#initialize
15      0.001     0.001     0.000     0.000       70   GraphQL::InternalRepresentation::Node#initialize
07      0.001     0.000     0.000     0.001       58   GraphQL::StaticValidation::FieldsWillMerge::FieldDefinitionComparison#initialize
04      0.001     0.000     0.000     0.000       64   GraphQL::Query::Arguments#initialize
01      0.000     0.000     0.000     0.000       11   GraphQL::StaticValidation::FragmentsAreUsed::FragmentInstance#initialize
01      0.000     0.000     0.000     0.000        1   GraphQL::Query#initialize
# ... truncated ...

Lots of GraphQL internals! That’s good news though: those are within scope for optimization.

MiddlewareChain was ripe for a refactor. In the old implementation, each field resolution created a middleware chain, then used it and discarded it. However, this was a waste of objects. Middlewares don’t change during query execution, so we should be able to reuse the same list of middlewares for each field.

This required a bit of refactoring, since the old implementation modified the array (with shift) as it worked through middlewares. In the end, this improvement was added in 5549e0cf. As a bonus, the number of created Arrays (shown by Array#initialize_copy) also declined tremendously since they were used for MiddlewareChain’s internal state. Also, calls to Array#shift were removed, since the array was no longer modified:

~/code/graphql $ cat 130_prof.txt | grep shift
  0.61      0.003     0.003     0.000     0.000     3496   Array#shift
~/code/graphql $ cat 150_prof.txt | grep shift
~/code/graphql $

🎉 !

The number FieldResult objects was also reduced. FieldResult is used for execution bookkeeping in some edge cases, but is often unneeded. So, we could optimize by removing the FieldResult object when we had a plain value (and therefore no bookkeeping was needed): 07cbfa89

A very modest optimization was also applied to GraphQL::Arguments, reusing the same object for empty argument lists (4b07c9b4) and reusing the argument default values on a field-level basis (4956149d).

Avoid Duplicate Calculations

Some elements of a GraphQL schema don’t change during execution. As long as this holds true, we can cache the results of some calculations and avoid recalculating them.

A simple caching approach is to use a hash whose keys are the inputs and whose values are the cached outputs:

# Read-through cache for summing two numbers
#
# The first layer of the cache is the left-hand number:
read_through_sum = Hash.new do |hash1, left_num|
  # The second layer of the cache is the right-hand number:
  hash1[num1] = Hash.new do |hash2, right_num|

    # And finally, the result is stored as a value in the second hash:
    puts "Adding #{left_num} + #{right_num}"
    hash2[right_num] = left_num + right_num
  end
end

read_through_sum[1][2]
# "Adding 1 + 2"
# => 3

read_through_sum[1][2]
# => 3

The first lookup printed a message and returned a value but the second lookup did not print a value. This is because the block wasn’t called. Instead, the cached value was returned immediately.

This approach was applied aggressively to GraphQL::Schema::Warden, an object which manages schema visibility on a query-by-query basis. Since the visibility of a schema member would remain constant during the query, we could cache the results of visibility checks: first 1a28b104, then 27b36e89.

This was also applied to field lookup in 133ed1b1e and to lazy_resolve handler lookup in 283fc19d.

Use `yield` Instead of `&block`

Due to the implementation of Ruby’s VM, calling a block with yield is much faster than block.call. @theorygeek migrated MiddlewareChain to use that approach instead in 517cec34.

Remove Overhead from Lazy Definition API (warning: terrible hack)

In order to handle circular definitions, graphql-ruby’s .define { ... } blocks aren’t executed immediately. Instead, they’re stored and evaluated only when a definition-dependent value is required. To achieve this, all definition-dependent methods were preceeded by a call to ensure_defined.

Maybe you remember that method from the very top of the profiler output above:

 %self      total      self      wait     child     calls  name
  4.60      0.074     0.022     0.000     0.052     6893  *Class#new
  3.99      0.019     0.019     0.000     0.000     8715  *GraphQL::Define::InstanceDefinable#ensure_defined

A fact about GraphQL::Schema is that, by the time it is defined, all lazy definitions have been executed. This means that during query execution, calling ensure_defined is always a waste!

I found a way to remove the overhead, but it was a huge hack. It works like this:

When a definition is added (with .define):

store the definition block for later
find each definition-dependent method definition on the defined object and gather them into an array:
```
@pending_methods = method_names.map { |n| self.class.instance_method(n) }
```
replace those methods with dummy methods which:
- call ensure_defined
- re-apply all @pending_methods, overriding the dummy methods
- call the real method (which was just re-applied)

This way, subsequent calls to definition-dependent methods don’t call ensure_defined. ensure_defined removed itself from the class definition after its work was done!

You can see the whole hack in 18d73a58. For all my teasing, this is something that makes Ruby so powerful: if you can imagine it, you can code it!

The Final Product

Two minor releases later, the profile output is looking better! Here’s the output on master:

Thread ID: 70178713115080
Fiber ID: 70178720382840
Total: 0.310395
Sort by: self_time

 %self      total      self      wait     child     calls  name
06      0.013     0.013     0.000     0.000     7644   Kernel#hash
93      0.021     0.009     0.000     0.012     2917  *Class#new
89      0.009     0.009     0.000     0.000     4184   GraphQL::Query::Context::FieldResolutionContext#query
74      0.009     0.009     0.000     0.000    13542   Module#===
60      0.008     0.008     0.000     0.000     1960   GraphQL::Query::Context::FieldResolutionContext#initialize
27      0.013     0.007     0.000     0.006     1748   GraphQL::Query#arguments_for
25      0.010     0.007     0.000     0.003     7523   Kernel#===
14      0.022     0.007     0.000     0.015     1959   GraphQL::Query::Context::FieldResolutionContext#spawn
09      0.007     0.007     0.000     0.000     8260   Kernel#is_a?
87      0.039     0.006     0.000     0.033     1748   GraphQL::Schema::RescueMiddleware#call
75      0.013     0.005     0.000     0.008     1748   GraphQL::Execution::Lazy::LazyMethodMap#get
69      0.005     0.005     0.000     0.000     2259   Kernel#class
68      0.044     0.005     0.000     0.039     3496  *GraphQL::Schema::MiddlewareChain#invoke_core
33      0.004     0.004     0.000     0.000     1747   GraphQL::Query::Context::FieldResolutionContext#schema
31      0.029     0.004     0.000     0.025     1748   #call
20      0.004     0.004     0.000     0.000     1748   GraphQL::Execution::SelectionResult#set
15      0.048     0.004     0.000     0.044     1748   GraphQL::Schema::MiddlewareChain#invoke
14      0.017     0.004     0.000     0.013     1748   GraphQL::Schema#lazy_method_name
07      0.004     0.003     0.000     0.001     1044   Kernel#public_send
05      0.020     0.003     0.000     0.017     1748   GraphQL::Schema#lazy?
02      0.004     0.003     0.000     0.000     1806   GraphQL::InternalRepresentation::Node#definition

Here are the wins:

Object allocations reduced by 58%
Method calls to gem code and Ruby built-ins reduced by … a lot!
Calls to ensure_defined reduced by 100% 😆

And, as shown in the benchmark above, 1.7x faster query execution!

There’s one caveat: these optimization apply to the GraphQL runtime only. Real GraphQL performance depends on more than that. It includes application-specific details like database access, remote API calls and application code performance.

Parallelism in GraphQL-Ruby

2017-01-22T10:23:00-05:00

It’s possible to get IO operations running in parallel with the graphql gem.

I haven’t tried this extensively, but I had to satisfy my curiosity!

Setup: Long-Running IO

Let’s say we have a GraphQL schema which has long-running IO- or system-bound tasks. Here’s a silly example where the long-running task is sleep:

QueryType = GraphQL::ObjectType.define do
  name "Query"
  field :sleep, !types.Int, "Sleep for the specified number of seconds" do
    argument :for, !types.Int
    resolve ->(o, a, c) {
      sleep(a["for"])
      a["for"]
    }
  end
end

Schema = GraphQL::Schema.define do
  query(QueryType)
end

Let’s consider a query like this one:

query_str = <<-GRAPHQL
{
  s1: sleep(for: 3)
  s2: sleep(for: 3)
  s3: sleep(for: 3)
}
GRAPHQL

puts query_str

puts Benchmark.measure {
  Schema.execute(query_str)
}

How long will it take?

$ ruby graphql_parallel.rb
{
  s1: sleep(for: 3)
  s2: sleep(for: 3)
  s3: sleep(for: 3)
}
  0.000000   0.000000   0.000000 (  9.009428)

About 9 seconds: three sleep(3) calls in a row.

Working in Another Thread

The concurrent-ruby gem includes Concurrent::Future, which runs a block in another thread:

future = Concurrent::Future.execute do
  # This will be run in another thread
end


future.value
# => waits for the return value of the block
#    and returns it

We can use it to put our sleep(3) calls in different threads. There are two steps.

First, use a Concurrent::Future in the resolve function:

- sleep(a["for"])
- a["for"]
+ Concurrent::Future.execute {
+  sleep(a["for"])
+  a["for"]
+ }

Then, tell the Schema to handle Concurrent::Futures by calling #value on them:

 Schema = GraphQL::Schema.define do
   query(QueryType)
+  lazy_resolve(Concurrent::Future, :value)
 end

Finally, run the same query again:

$ ruby graphql_parallel.rb
{
  s1: sleep(for: 3)
  s2: sleep(for: 3)
  s3: sleep(for: 3)
}
  0.000000   0.000000   0.010000 (  3.011735)

🎉 Three seconds! Since the sleep(3) calls were in different threads, they were executed in parallel.

Real Uses

Ruby can run IO operations in parallel. This includes filesystem operations and socket reads (eg, HTTP requests and database operations).

So, you could make external requests inside a Concurrent::Future, for example:

Concurrent::Future.execute {
  open("http://wikipedia.org")
}

Or, make a long-running database call inside a Concurrent::Future:

Concurrent::Future.execute {
  DB.exec(long_running_sql_query)
}

Caveats

Switching threads incurs some overhead, so multithreading won’t be worth it for very fast IO operations.

GraphQL doesn’t know which resolvers will finish first. Instead, it starts each one, then blocks until the first one is finished. This means that subsequent long-running fields may have to wait longer than they “really” need to. For example, consider this query:

{
  sleep(for: 5)
  nestedSleep(for: 2) {
    sleep(for: 2)
  }
}

Even with multithreading, this would take about 7 seconds to execute. First, GraphQL would wait for sleep(for: 5), then it would get to nestedSleep(for: 2), which would have already finished, then it would execute sleep(for: 2).

Conclusion

If your GraphQL schema is wrapping pre-existing HTTP APIs, using a technique like this could reduce your GraphQL response time.

Introducing GraphQL::Pro

2017-01-09T09:47:00-05:00

graphql-ruby is almost two years old! Today, I’m adding a new element to the project, GraphQL::Pro.

I have three goals with GraphQL::Pro:

Provide robust, easy-to-use integrations with third-party tools
Open a formal feedback loop with teams using GraphQL
Prevent open-source burnout

Additionally, I’m starting a GraphQL Ruby newsletter.

Integrations

Today, GraphQL::Pro provides some integrations with third-party tools:

Authorization plugins for Pundit and CanCan (or custom auth)
Improved connections for ActiveRecord
Instrumentation for app monitoring platforms (New Relic, Scout and Skylight).

As time goes on, I’ll keep an eye out for other integrations that could be included in GraphQL::Pro. (If you have a suggestion, I’d love to hear it!)

Feedback Loop

Some teams adopt GraphQL as a foundational element of their application. I’d like to provide them service (and peace of mind) as they build on that investment. GraphQL::Pro customers have my ear for any performance issues, bugs or feature requests. They also have an assurance that I’ll continue to maintain and improve graphql-ruby.

Prevent Burnout

I really enjoy working on graphql-ruby and I’m excited about the work to be done in 2017. But it’s no secret that open-source work can become an unrewarding, thankless grind. Charging money for GraphQL::Pro provides me with a simple, concrete “reward” to continue the work. I hope this will be good for me, for the project, and for others who are invested in the project.

Buying GraphQL::Pro

If any of this sounds good to you, you can buy GraphQL::Pro at http://graphql.pro !

Raising Exceptions is Bad

2016-11-23T10:34:00-05:00

In general, raising exceptions for control flow makes code hard to understand. However, there are other cases when an exception is the right choice.

Raise vs Return

raise is return’s evil twin.

They both stop the execution of the current method. After a return, nothing else is executed. After a raise, nothing else is executed … maybe. The method may have a rescue or ensure clause which is executed after the raise, so a reader must check for those.

They both change flow of control. return gives control back to the caller. raise may give control anywhere on the call stack, depending on the specific error and rescue clauses. If all you see is a raise, you can’t guess where it will be rescued!

They both send values to their new destination. return provides the given value to the caller, who may capture the return value in a local variable. raise provides the error object to the rescue-er. return can send any kind of value, but raise can only send error objects.

They both create coupling across call stack frames. return couples two adjacent call stack frames: caller depends on the return value. raise → rescue couples far-removed stack frames: they may be adjacent, or they may be several frames removed from one another.

Raise → Rescue is Unpredictable

Sending values through a program by calling methods and return-ing values is very predictable. If you return a different value, the caller will get a different value. To see where return values “go”, simply search for calls to that method.

Finding where raise’d errors go is a bit more challenging. For example, this change:

# From:
def do_something
  # ...
  raise "Something went wrong"
end

# To:
class MyCustomError < StandardError
end

def do_something
  # ...
  raise MyCustomError, "Oops!"
end

How can you tell if this is a safe refactor? Here are some considerations:

Instead of looking for callers of this method, you have to find entire call stacks which include this method, since any upstream calls may also have expectations about this error.
When searching for rescues, you have to keep the error’s ancestry in mind, finding bare rescues, superclass-tagged rescues and class-tagged rescues.
Some rescues may consume the error object itself. For example, they may read its #message or other attached data. If you change any properties of the error object, you may break the assumptions of those rescues.
If you find that the new error will be rescue’d differently, you must also consider how execution flow will change in other methods. For example, some methods may be cut short because previously-rescue’d errors now propagate through them. Other methods which used to be cut short may now continue running, since errors are rescued in child method calls.

If your raise is located in a Ruby gem, these problems are even harder, because rescue clauses may exist in your users’ code.

If your error patterns are well documented, ༼ つ ◕_◕ ༽つ 🏆. Bravo, just don’t break your public API. Users might still make assumptions beyond the documentation, such as error ancestry or message values. Additionally, they could be monkey-patching library methods and applying rescue-related assumptions to those patches.

If your error patterns aren’t documented, 💩 ノ༼ ◕_◕ ノ ༽. You have no idea what assumptions users make about those errors! You can’t be sure your changes won’t break their code.

Use Return Instead

raise can be replaced by return. However, if you’re using raise to traverse many levels of the call stack, the refactor will be intense. Take heart: previously you were hacking your way back up the call stack, now you’re creating a predictable, explicit flow through your program!

It’s worth repeating, don’t use exceptions for flow control.

Here are some techniques for expressing failures with return.

Return errors instead of raising them. Ruby errors are objects, like everything else. You can return them to the caller and let the caller check whether the returned value is an error or not. For example, to return an error:

def do_something
  calculation = SomeCalculation.new # ...

  if calculation.something_went_wrong?
    # Let the caller handle this error
    MyCustomError.new("oops!")
  else
    # Return the result to the caller
    calculation.result
  end
end

Use success and failure objects. Instead of returning a raw StandardError instance to the caller, use a Failure class to communicate failure. Additionally, use a Success class to communicate success. (This is similar to the “monad” technique, eg dry-monads gem.)

class ConvertSuccess
  attr_reader :old_file, :new_file
  def initialize(old_file:, new_file:)
    # ...
  end
end

class ConvertFailure
  attr_reader :old_file, :error
  def initialize(old_file:, error:)
    # ...
  end
end

# Try to convert this file, returning either a
# ConvertSuccess or ConvertFailure)
def convert_file(file)
  # ...
  if error_message.nil?
    ConvertSuccess.new(old_file: file, new_file: converted_file)
  else
    ConvertFailure.new(old_file: file, error: error_message)
  end
end

# Try to convert a file,
# then specify behavior
# for failure case & success case:
conversion = convert_file(File.read(file_path))

case conversion
when ConvertSuccess
  # Do something with the new file
when ConvertFailure
  # Notify the user of the failure
end

As a last resort, return nil. Using nil as an expression of failure has some downsides:
- nil can’t hold a message or any extra data
- sometimes, nil is a valid value
But, for simple operations, using nil may be sufficient. Since it will be communicated via return, refactoring it will be straightforward in the future!

Sometimes, Raise is Okay

raise has its purposes.

raise is a great way to signal that the program has reached a completely unexpected state and that it should exit. For example, in the convert_file example above, we could use raise to assert that we don’t receive an unexpected value from convert_file:

conversion = convert_file(File.read(file_path))
case conversion
when ConvertSuccess
  # Do something with the new file
when ConvertFailure
  # Notify the user of the failure
else
  raise("convert_file didn't return a ConvertSuccess or ConvertFailure, it returned: #{conversion.inspect}")
end

Now, if the method ever returns some unexpected value, we’ll receive a loud failure. Some people use fail in this case, which is also fine. However, the need to disambiguate raise and fail is a code smell: stop using raise for non-emergencies!

raise is also helpful for re-raising other errors. For example, if your library needs to log something when an error happens, it might need to capture the error, then re-raise it. For example:

# This method yields to a user-provided block, eg
# `handle_converted_file(old_file) { |f| push_to_s3(f) }`
def handle_converted_file(old_file)
  conversion = convert_file(old_file)
  if conversion.is_a?(ConvertSuccess)
    yield(conversion.new_file)
  end
rescue StandardError => err
  # Make a log entry for the library:
  logger.log("User error from handle_converted_file", err)
  # Let the user handle this error:
  raise(err)
end

This way, you can respond to the error without disrupting user code.

raise SharpKnifeError

In my own work, I’m transitioning away from raising errors and towards communicating failure by return values. This pattern is ubiquitous in languages like Go and Elixir. In Node.js, callbacks communicate errors in a similar way (callback arguments). I think Ruby code can benefit from this practice as well.

Robert Mosolgo

Books I Read in 2019

Finding implicit returns with Rubocop

The Refactor: Returning Promises

The Problem: Boolean Logic

The Solution, In Theory

The Implementation: A Cop

Breaking out of a yield with return

Instrumentating a block

Returning early

It Jumped!

With Ensure

How I Make Yogurt

Prep and sterilize

Heat the milk & hold

Cool the milk

Inoculate

Bottle it

Set it

Forget it

Eat it

A New Runtime in GraphQL-Ruby 1.9

Problem 1: per-field context objects

Solution: one mutable context

Solution: explicit requests for runtime info

Solution: reimplementing the runtime

Problem 2: inefficient preprocessing

Problem 2.1: Preparing the irep_nodes was slow and often a waste

Problem 2.2: Runtime features were implemented during preprocessing

Problem 2.3: A wacky preprocessing step is hard to understand

Solution: No preprocessing

Solution: AST Analyzers

Solution: Moving ahead-of-time checks to runtime

Solution: GraphQL::Execution::Lookahead

Other considerations

Resolve procs are out

.to_graphql is almost out

Conclusion

Notes on Small is Beautiful

Background

Greed and Capitalism

Human and Inhuman, Freedom and Order

Intermediate Technology

Metaphysical Underpinnings for Economics

A Theory of Large Organizations

Schumacher’s Foundation

Trampolining

The Problem

Trampolining

The Setup

The Pledge: Recursive calls

The Turn: Moving Recursive Calls into Tail Position

A Hack Won’t Work

Pass the Path as Input

The Prestige: Make it Bounce

How’d it work?

Implementation Considerations

What about Speed?

What’s next?

Trip Report: Balkan Ruby 2018

The Conference

The City

The Organizers

Closing Thoughts

How Ripper parses variables

:vcall

:var_ref

Method calls

Updating GitHub to GraphQL 1.8.0

The Process

About the Upgrader

Running The Upgrader

The Pipeline

Kinds of Transforms

Custom Transforms

Handle a custom type-definition DSL

Renaming a Custom Field Method

Moving DSL methods to keywords

Custom Skip

Fixes by Hand

Problem 2.1: Preparing the `irep_node`s was slow and often a waste

Solution: `GraphQL::Execution::Lookahead`

`.to_graphql` is almost out

The Easy Way: `.redefine`

The Hard Way: `.metadata[:type_class]`

Use `yield` Instead of `&block`