Sean Cribbs

Webmachine in Elixir Tutorial, Part 4

2016-08-17T00:00:00-05:00

In the previous installment, we displayed some dynamic content in our webpage that was loaded from an ets table. Even a social network themed around The Wire is no fun if you can’t add your favorite quotes to it. Let’s hook up a form so that we can submit to post new tweets, and a resource to accept that POST.

Creating tweets

Our HTML file already has portions of a form in it, but let’s add the ability to select your character.

      <div id="add-tweet-form" class="add-tweet-form">
        <select id="add-tweet-avatar" class="person">
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/f/f4/The_Wire_Jimmy_McNulty.jpg/250px-The_Wire_Jimmy_McNulty.jpg">Jimmy</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/1/15/The_Wire_Bunk.jpg/250px-The_Wire_Bunk.jpg">Bunk</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/6/6c/The_Wire_Kima.jpg/250px-The_Wire_Kima.jpg">Kima</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/7/73/The_Wire_Bubbles.jpg/250px-The_Wire_Bubbles.jpg">Bubbles</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/2/2f/The_Wire_Avon.jpg/250px-The_Wire_Avon.jpg">Avon</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/b/b7/The_Wire_Stringer_Bell.jpg/250px-The_Wire_Stringer_Bell.jpg">Stringer</option>
          <option value="http://upload.wikimedia.org/wikipedia/en/thumb/7/78/The_Wire_Omar.jpg/250px-The_Wire_Omar.jpg">Omar</option>
        </select>
        <input id="add-tweet-message" type="text" class="message" />
        <a id="add-tweet-submit" class="button post">POST!</a>
      </div>

Now we need some JavaScript to show and hide the “form”, and submit it.

  // Toggle the visibility of the form
  $('#add-tweet').click(function() {
    $('#add-tweet-form').toggle();
  });

  // Submit the form on click of the "POST!" button
  $('#add-tweet-submit').click(function() {
    var tweetMessageField = $('#add-tweet-message');
    var tweetMessageAvatar = $('#add-tweet-avatar');
    var tweetMessageForm = $('#add-tweet-form');
    var tweetMessage = tweetMessageField.val();
    var tweetAvatar = tweetMessageAvatar.val();

    $.ajax({
      type: 'POST',
      url: '/tweets',
      contentType: 'application/json',
      data: JSON.stringify({ tweet: {
        avatar: tweetAvatar,
        message: tweetMessage }}),
      success: function(d) {
        tweetMessageField.val('');
        tweetMessageForm.toggle();
      }
    });
  });

If you refresh the browser and click the button to “POST!”, you should get an error in the JavaScript console, specifically 405 Method Not Allowed. This is because our TweetList resource doesn’t accept POST!

We have two options here, we can modify our existing resource or create a new resource to handle the form submission. The difference will be whether we can tolerate the extra logic for accepting the form amongst the logic for producing a list of tweets, or whether we prefer to create a resource that only accepts the form. Personally, I could handle either way (and potentially change my mind later) but I’m going to choose the latter for clarity and to demonstrate another feature of Webmachine. Let’s create our new resource!

defmodule Tweeter.Resources.Tweet do
  # Boilerplate again! In this case the state is the contents of our
  # tweet, initially an empty identifier and attribute list.
  def init(_), do: {:ok, {nil, []}}
  def ping(req_data, state), do: {:pong, req_data, state}
end

Now, the whole goal of this resource was to accept POST requests, so we better allow them.

  # This resource supports POST for creating tweets. We colon-prefix
  # the POST atom to ensure the Elixir compiler doesn't treat it as
  # a module name:
  #
  #   iex> :io.format('~s~n', [POST])
  #   Elixir.POST
  #   :ok
  #   iex> :io.format('~s~n', [:POST])
  #   POST
  #   :ok
  #   iex> POST == :POST
  #   false
  def allowed_methods(req_data, state) do
    {[:POST], req_data, state}
  end

Now we can talk about what our options are for accepting the form. We can take two paths: first, we assume the accepting resource always exists and simply handles the POST in its own way; second, we assume the resource CREATES new resources, and treat it as a PUT to a new URI.

The latter way will feel very familiar if you’ve used Rails before, except that you need to pick the new URI before the request body is accepted! This trips developers up frequently, but is easy to work around. Since we already know how to allocate unique identifiers using monotonic_time, constructing a new unique URI should be straightforward.

Our first step is to tell Webmachine that our resource doesn’t exist, that POST means creating a new resource, and that it’s okay to allow POST when the resource doesn’t exist:

  # When we accept POST, our resource is missing!
  def resource_exists(req_data, state) do
    {false, req_data, state}
  end

  # Accepting POST means creating a resource.
  def post_is_create(req_data, state) do
    {true, req_data, state}
  end

  # Allow POST to missing resources
  def allow_missing_post(req_data, state) do
    {true, req_data, state}
  end

Now we should pick the URI where our new tweet will live. Whether or not we allow fetching that new URI is another question entirely! We might revisit that later in the tutorial.

  # Generate the path for the new resource, populating the ID in the
  # state as well.
  def create_path(req_data, {_id, attrs}) do
    new_id = System.monotonic_time
    {'/tweets/#{new_id}', req_data, {new_id, attrs}}
  end

If our application were backed by a database like PostgreSQL, we could use an SQL query here to fetch the next ID in the table’s sequence. Instead, we just call monotonic_time. Note how we capture the generated ID in the resource state for when we insert the tweet into the ETS table.

We’re submitting application/json from the Ajax request, but Webmachine doesn’t know that it’s ok to accept it, or what to do with it when it arrives. Similar to content_types_provided, we can specify this with the content_types_accepted callback.

  # We take JSON
  def content_types_accepted(req_data, state) do
    {[{'application/json', :from_json}], req_data, state}
  end

Finally, we parse the incoming JSON, extract the fields, and put the data in the ETS table.

  def from_json(req_data, {id, attrs}) do
    # We use a try here so that our pattern match throws if we fail to
    # decode or extract something from the request body.
    try do
      # Parse the request body, extracting the attributes of the tweet
      # First fetch the request body
      req_body = :wrq.req_body(req_data)
      # Second, decode the JSON and destructure it
      {:struct, [{"tweet", {:struct, attrs}}]} = :mochijson2.decode(req_body)
      # Now fetch the message and avatar attributes from the JSON
      {"message", message} = List.keyfind(attrs, "message", 0)
      {"avatar", avatar} = List.keyfind(attrs, "avatar", 0)
      # Finally construct the data to go into ETS
      new_attrs = [avatar: avatar, message: message, time: :erlang.timestamp]
      # Insert into ETS and return true
      :ets.insert(:tweets, [{id, new_attrs}])
      {true, req_data, {id, new_attrs}}
    rescue
      # If we threw from the above block, we should fail the request
      # from the client. MatchError could be raised from our
      # pattern-match when decoding JSON, or from :mochijson2
      # itself. CaseClauseError is raised by :mochijson2 alone when 
      # we get bad JSON.
      err in [MatchError, CaseClauseError] ->
        {false, req_data, {id, attrs}}
    end
  end

Before our new resource will work, however, we need to dispatch to it! Since we wanted to use the same URI as the TweetList resource, we need to make sure that only POST requests make it to the Tweet resource. This is where a route guard comes in. Route guard functions take one argument, the req_data we’ve been passing around, and should return a boolean. If the route is a 4-tuple, with the second element being a route guard function, that guard will be tested before dispatching is done (but after the path has matched).

# --- lib/tweeter.ex ---
    # Some configuration that Webmachine needs
    web_config = [ip: {127, 0, 0, 1},
                  port: 8080,
                  dispatch: [
                    # Note the guard function in the second position
                    {['tweets'], &(:wrq.method(&1) == :POST), Tweeter.Resources.Tweet, []},
                    {['tweets'], Tweeter.Resources.TweetList, []},
                    {[], Tweeter.Resources.Assets, []},
                    {[:'*'], Tweeter.Resources.Assets, []}
                  ]]

Now reload mix and see if you can post a tweet! (You might need to reload the page after posting too.)

Up next

In our next and final installment, we’ll learn how to deliver live updates to the client.

Webmachine in Elixir Tutorial, Part 3

2016-07-18T00:00:00-05:00

Last time we started serving static files from the priv directory, supporting validation caching, conditional requests, and compression. But since static files are pretty boring, let’s populate our application’s page with some dynamic content.

Dynamic content

Instead of the static “tweets” in the homepage, let’s replace them with some content injected via Ajax.

First, remove all of the <li> elements from the <ul> in priv/www/index.html (shown below):

<div id="content">
  <ul id="tweet-list" class="tweet-list">
  <!-- Remove all the "li" elements here -->
  </ul>
</div>

To generate the dynamic content, we need to keep the base data somewhere that we can fetch at runtime, we need to generate some JSON for the browser, and we need to render that JSON into HTML.

Let’s start by storing our tweets somewhere. Normally one would use a real database and probably pull in Ecto to handle it. Since this is just a tutorial, we’ll use ets (Erlang Term Storage). Create the ets table and populate it when the application starts up in the start/2 function in tweeter.ex.

  # Monotonic time gives us unique, increasing integers to use as
  # identifiers.
  defp now, do: :erlang.monotonic_time
  # The timestamp is a standard Erlang 3-tuple format expected by
  # Webmachine
  defp timestamp, do: :erlang.timestamp

## --- inside start/2

    # Create a public ets table to store our tweets
    :ets.new(:tweets, [:public, :ordered_set, :named_table,
                       read_concurrency: true, write_concurrency: true])

    # Insert a bunch of tweets
    :ets.insert(:tweets,
                [{now, [avatar: "http://upload.wikimedia.org/wikipedia/en/thumb/f/f4/The_Wire_Jimmy_McNulty.jpg/250px-The_Wire_Jimmy_McNulty.jpg",
                         message: "Pawns.",
                         time: timestamp]},
                 {now, [avatar: "http://upload.wikimedia.org/wikipedia/en/thumb/1/15/The_Wire_Bunk.jpg/250px-The_Wire_Bunk.jpg",
                         message: "A man's gotta have a code.",
                         time: timestamp]},
                 {now, [avatar: "http://upload.wikimedia.org/wikipedia/en/thumb/f/f4/The_Wire_Jimmy_McNulty.jpg/250px-The_Wire_Jimmy_McNulty.jpg",
                         message: "You boys have a taste?",
                         time: timestamp]}
                ])

If you refresh your browser now, you won’t see anything because we deleted the content without repopulating it. Instead, start up iex and view the table using :ets.i(:tweets):

$ iex -S mix
Erlang/OTP 18 [erts-7.2.1] [source] [64-bit] [smp:8:8] [async-threads:10] [hipe] [kernel-poll:false] [dtrace]

Compiled lib/tweeter.ex
Interactive Elixir (1.2.3) - press Ctrl+C to exit (type h() ENTER for help)
iex(1)> :ets.i(:tweets)
<1   > {-576460751446910149,
 [{avatar,<<"http://upload.wikimedia.org/wiki  ...
<2   > {-576460751446908987,
 [{avatar,<<"http://upload.wikimedia.org/wiki  ...
<3   > {-576460751446908195,
 [{avatar,<<"http://upload.wikimedia.org/wiki  ...
EOT  (q)uit (p)Digits (k)ill /Regexp -->q
:ok

For some reason, the monotonic time returns a negative number on my machine, but it should be sufficient to ensure uniqueness and ordering. Now we should get that content into the browser, so it’s time to make a new resource! Open up lib/tweeter/resources/tweet_list.ex and paste in the boilerplate:

defmodule Tweeter.Resources.TweetList do
  # Basic initialization
  def init(_), do: {:ok, []}

  # Boilerplate function, which we should inject later
  def ping(req_data, state), do: {:pong, req_data, state}
end

I’ve chosen a list this time for the resource state, because we have a list of tweets to send to the browser. Let’s tell Webmachine we want to serve JSON and how to render it:

def content_types_provided(req_data, state) do
  # Provide JSON!
  {[{'application/json', :to_json}], req_data, state}
end

def to_json(req_data, state) do
  # We assume we have already fetched the tweets from ets into the state:
  tweet_list = for {_id, attributes} <- state, do: {:struct, attributes}
  # We could use Poison or jsx here, but mochijson2 is included with
  # mochiweb.
  {:mochijson2.encode({:struct, [tweets: tweet_list]}), req_data, state}
end

Before we actually fetch the data from ETS, let’s hook up our new resource and see that it’s working.

# -- lib/tweeter.ex
    # Some configuration that Webmachine needs
    web_config = [ip: {127, 0, 0, 1},
                  port: 8080,
                  dispatch: [
                    {['tweets'], Tweeter.Resources.TweetList, []},
                    {[], Tweeter.Resources.Assets, []},
                    {[:'*'], Tweeter.Resources.Assets, []}
                  ]]

Now we should be able to bounce our server and see the JSON via curl:

$ curl -i http://localhost:8080/tweets
HTTP/1.1 200 OK
Server: MochiWeb/1.1 WebMachine/1.10.9 (cafe not found)
Date: Wed, 16 Mar 2016 15:49:36 GMT
Content-Type: application/json
Content-Length: 13

{"tweets":[]}

Now let’s go back and fetch the tweets from ETS properly:

# Fetches the data from ets, even though our resource "always"
# exists.
def resource_exists(req_data, _state) do
  {true, req_data, :ets.tab2list(:tweets)}
end

After bouncing our server, I get this response:

$ curl -i http://localhost:8080/tweets
HTTP/1.1 500 Internal Server Error
Server: MochiWeb/1.1 WebMachine/1.10.9 (cafe not found)
Date: Wed, 16 Mar 2016 15:51:11 GMT
Content-Type: text/html
Content-Length: 1166

<html><head><title>500 Internal Server Error</title></head><body><h1>Internal Server Error</h1>The server encountered an error while processing this request:<br><pre>{error,{exit,{json_encode,{bad_term,{1458,141859,956053}}},
             [{mochijson2,json_encode,2,
                          [{file,"src/mochijson2.erl"},{line,181}]},
              {mochijson2,'-json_encode_proplist/2-fun-0-',3,
                          [{file,"src/mochijson2.erl"},{line,199}]},
              {lists,foldl,3,[{file,"lists.erl"},{line,1262}]},
              {mochijson2,json_encode_proplist,2,
                          [{file,"src/mochijson2.erl"},{line,202}]},
              {mochijson2,'-json_encode_array/2-fun-0-',3,
                          [{file,"src/mochijson2.erl"},{line,189}]},
              {lists,foldl,3,[{file,"lists.erl"},{line,1262}]},
              {mochijson2,json_encode_array,2,
                          [{file,"src/mochijson2.erl"},{line,191}]},
              {mochijson2,'-json_encode_proplist/2-fun-0-',3,
                          [{file,"src/mochijson2.erl"},{line,199}]}]}}</pre><P><HR><ADDRESS>mochiweb+webmachine web server</ADDRESS></body></html>

Woops! We got that timestamp but didn’t make it something that makes sense in JSON. Let’s turn it into a numerical timestamp for the client (microseconds since the epoch, basically).

def to_json(req_data, state) do
  # We assume we have already fetched the tweets from ets into the state:
  tweet_list = for {_id, attributes} <- state do
    {:struct, put_in(attributes, [:time], convert_timestamp(attributes[:time]))}
  end
  # We could use Poison or jsx here, but mochijson2 is included with
  # mochiweb.
  {:mochijson2.encode(tweet_list), req_data, state}
end

defp convert_timestamp({mega, sec, micro}) do
  mega * 1000000 * 1000000 + sec * 1000000 + micro
end

Let’s try fetching our resource again:

$ curl -i http://localhost:8080/tweets
HTTP/1.1 200 OK
Server: MochiWeb/1.1 WebMachine/1.10.9 (cafe not found)
Date: Wed, 16 Mar 2016 15:51:57 GMT
Content-Type: application/json
Content-Length: 534

{"tweets":[{"avatar":"http://upload.wikimedia.org/wikipedia/en/thumb/f/f4/The_Wire_Jimmy_McNulty.jpg/250px-The_Wire_Jimmy_McNulty.jpg","message":"Pawns.","time":1458141859956053},{"avatar":"http://upload.wikimedia.org/wikipedia/en/thumb/1/15/The_Wire_Bunk.jpg/250px-The_Wire_Bunk.jpg","message":"A man's gotta have a code.","time":1458141859956054},{"avatar":"http://upload.wikimedia.org/wikipedia/en/thumb/f/f4/The_Wire_Jimmy_McNulty.jpg/250px-The_Wire_Jimmy_McNulty.jpg","message":"You boys have a taste?","time":1458141859956056}]}

Great! Now we can hook this up to our front-end with a little JavaScript. We’re going to keep it simple by using jQuery instead of a framework.

// priv/www/js/app.js
$(document).ready(function() {
  var generateTweet = function(tweet) {
    return "<li><div class='avatar' style='background: url(" +
           tweet.avatar +
           "); background-size: auto 50px; background-position: center center;'></div><div class='message'>"
           + tweet.message + "</div></li>";
  };

  $.ajax({
    url: '/tweets',
    success: function(d) {
      if(d.tweets) {
        var tweetList = $('#tweet-list');

        d.tweets.reverse().forEach(function(i) {
          tweetList.append(generateTweet(i));
        });
      }
    }
  });
});

And hook up our JavaScript code at the bottom of index.html:

    </article>
    <!-- This is probably bad form, but it's a demo. -->
    <script type="application/javascript" src="http://code.jquery.com/jquery-1.12.1.min.js"></script>
    <script type="application/javascript" src="js/app.js"></script>
  </body>
</html>

Refresh your browser and see the Ajax populate the tweet-list!

Up next

Now that we’ve got some dynamic content being displayed, in the next installment we’ll allow the browser to post new tweets and insert them into our ets table.

Webmachine in Elixir Tutorial, Part 2

2016-07-14T00:00:00-05:00

Last time, we got our project set up and serving some simple dynamic content. In this installment, we’ll show how to serve static files via Webmachine so we can discuss lots of its best features.

Serving static files

Most times you would let a web-server like Apache or nginx serve your static files, but for our tutorial it’s nice to serve our content directly via Webmachine. By doing so, we can demonstrate several important features related to the dispatcher, content-negotiation, and conditional requests. Basically, everything you’d expect out of a well-configured web-server but in a resource module! First make a priv directory (where OTP apps store non-code files) and we’ll put our design assets in there. For now, we’ll just copy files from the webmachine-tutorial repo.

$ mkdir -p priv/www/css priv/www/img
$ curl -o priv/www/css/master.css https://raw.githubusercontent.com/cmeiklejohn/webmachine-tutorial/bf86b8230259ed710bf1ab3f32a5c64bfb9f03bc/priv/www/css/master.css
$ curl -o priv/www/index.html https://raw.githubusercontent.com/cmeiklejohn/webmachine-tutorial/bf86b8230259ed710bf1ab3f32a5c64bfb9f03bc/priv/www/index.html
$ curl -o priv/www/img/noise.png https://raw.githubusercontent.com/cmeiklejohn/webmachine-tutorial/bf86b8230259ed710bf1ab3f32a5c64bfb9f03bc/priv/www/img/noise.png

Let’s make a new resource module called Tweeter.Resources.Assets and fill out the boilerplate. For our resource state, we’ll use a map this time, but we’ll probably change it to a struct later.

defmodule Tweeter.Resources.Assets do
  # Basic initialization
  def init(_), do: {:ok, %{}}

  # Boilerplate function, which we should inject later
  def ping(req_data, state), do: {:pong, req_data, state}
end

Now we need to think about a few things, namely, how to determine which file is being requested, what media type it is, and then how to read it from the filesystem out to the client. Let’s start from the end, assuming we’ve already determined the correct file to read. We’ll make a body-producing function that simply reads the file and sends it to the client. This is not the most efficient way – sendfile() or other streaming would be better – but we are serving small files so it won’t be too bad.

  # Body-producing function
  def produce_resource(req_data, %{filename: filename} = state) do
    {File.read!(filename), req_data, state}
  end

That was easy! Continuing backwards through our list, let’s determine the media type of the file and point it at our body-producing function using the content_types_provided Webmachine callback. This callback tells Webmachine what media types you provide, and what to call to produce each one. Since ours is just reading a file from the filesystem, we’ll call produce_resource, but vary the type it produces.

  # Content-negotiation callback
  def content_types_provided(req_data, state) do
    filename = case :wrq.disp_path(req_data) do
                 '' -> 'index.html'
                 f  -> f
               end
    media_type = :webmachine_util.guess_mime(filename)
    {[{media_type, :produce_resource}], req_data, state}
  end

This is the first time we’ve used a Webmachine library function in a resource. :wrq.disp_path gives us the portion of the path that the dispatcher matched against. So at the root URL, this will be the empty string, otherwise, it’ll be a partial path to some file, like css/master.css. Then :webmachine_util.guess_mime is used to guess what a proper media type will be. For fun, let’s try that function from the shell via iex -S mix.

iex(1)> :webmachine_util.guess_mime('foo.png')
'image/png'
iex(2)> :webmachine_util.guess_mime('application.js')
'application/x-javascript'
iex(3)> :webmachine_util.guess_mime('home.html')
'text/html'
iex(4)> :webmachine_util.guess_mime('module.erl')
'text/plain'

Now that we have a body producing function, and the correct MIME type, let’s find the file on the filesystem, via one of the most important callbacks resource_exists. Obviously, if the file doesn’t exist in our static assets, we should return a 404 Not Found, and this is also a perfect place to populate the state with an absolute path to the requested file.

  # Find the file!
  def resource_exists(req_data, state) do
    priv_dir = Path.join :code.priv_dir(:tweeter), "www"
    absolute_path = Path.join(priv_dir, :wrq.disp_path(req_data)) |> Path.expand
    {File.regular?(absolute_path),
     req_data,
     %{state | filename: absolute_path}}
  end

Before we move on, there’s some repeated functionality with content_types_provided, and we have a minor bug too – at the root path we want to serve index.html. Let’s extract that shared functionality into a new function.

  # Find the file!
  def resource_exists(req_data, state) do
    # Find the root of our static files, add the identified path
    file_path = Path.join [:code.priv_dir(:tweeter), "www", identify_file(req_data)]
    # Compute the full path
    absolute_path = Path.expand file_path
    {File.regular?(absolute_path),
     req_data,
     Map.put(state, :filename, absolute_path)}
  end

  # Content-negotiation callback
  def content_types_provided(req_data, state) do
    media_type = req_data |>
      identify_file |>
      String.to_char_list |>
      :webmachine_util.guess_mime
    {[{media_type, :produce_resource}], req_data, state}
  end

  # Identifies the file we're trying to serve, normalizing path
  # segments
  defp identify_file(req_data) do
    # Getting the path tokens removes any duplicate slashes
    case :wrq.path_tokens(req_data) do
      # At the root path (no tokens), we want to serve index.html
      [] -> ["index.html"]
      # Otherwise serve the path they asked for
      toks -> toks
    end |> Path.join
  end

To get this resource to actually serve content, we now need to hook it up to the dispatcher. Let’s edit tweeter.ex again, replacing our Hello resource with Assets.

    # Some configuration that Webmachine needs
    web_config = [ip: {127, 0, 0, 1},
                  port: 8080,
                  dispatch: [
                    {[], Tweeter.Resources.Assets, []},
                    {[:*], Tweeter.Resources.Assets, []}
                  ]]

Note the special path segment :*. This tells the Webmachine dispatcher to match any number of trailing path segments. Kill/restart your mix process and refresh the page!

Reducing waste

This strategy of reading a file from disk and sending it to the client is as old as the web itself, but there’s much more we can do! HTTP includes caching in the protocol, and it’d be pretty inefficient for a client to fetch unchanged design assets every time they refresh the page.

Let’s add some simple validation caching to our assets resource. We can start by using the last_modified callback.

  # Last-Modified date
  def last_modified(req_data, %{filename: filename} = state) do
    mtime = File.stat!(filename, time: :universal).mtime
    {mtime, req_data, state}
  end

This is pretty simple: we read the file statistics, pulling out the mtime field which represents when it was last modified. We can use the File.stat! function instead of its safe equivalent because of the flow of Webmachine’s decision graph. That is, we know that last_modified will not be called if resource_exists returns false.

We can go even further by using entity tags, or “ETag” for short. These are usually a hash string of various aspects of the file’s metadata. Since we might be doing that File.stat! call in multiple places, let’s put it in resource_exists while we’re at it and save the result.

  # Find the file!
  def resource_exists(req_data, state) do
    # Find the root of our static files, add the identified path
    file_path = Path.join [:code.priv_dir(:tweeter), "www", identify_file(req_data)]
    # Compute the full path
    absolute_path = Path.expand file_path
    state = Map.put(state, :filename, absolute_path)
    # Return true if it exists and read the file info into the state
    # for future callbacks
    if File.regular?(absolute_path) do
      state = Map.put(state, :fileinfo, File.stat!(absolute_path))
      {true, req_data, state}
    else
      {false, req_data, state}
    end
  end

  # Last-Modified date
  def last_modified(req_data, %{fileinfo: fileinfo} = state) do
    {fileinfo.mtime, req_data, state}
  end

  # ETag
  def generate_etag(req_data, %{fileinfo: fileinfo} = state) do
    hash = {fileinfo.inode, fileinfo.mtime} |>
      :erlang.phash2 |>
      :mochihex.to_hex
    {hash, req_data, state}
  end

We use the built-in :erlang.phash2 function to compute the ETag, but you should probably use a better hash in other resources.

Finally, I noticed that our CSS and HTML, although small, are still multiple kilobytes. We can reduce the transmission time significantly through compression, using the encodings_provided callback. Somewhat similar to content_types_provided, it returns a list of pairs, where the first is an encoding and the second is a fn that performs the encoding.

  # Compression selection
  def encodings_provided(req_data, state) do
    {[{'identity', &(&1)}, # identity function!
      {'gzip', &:zlib.gzip/1},
      {'deflate', &:zlib.zip/1}],
     req_data, state}
  end

Note that this is a case again where Webmachine requires character lists and not binaries (single-quoted strings). Now that our compression is in place, I see the index.html file going from ~1KB to 560B, and the CSS file from 6.7KB to ~1KB. Nice bandwidth savings!

Up next

In the next installment, we’ll learn start serving some dynamically-generated content from a resource.

Webmachine in Elixir Tutorial, Part 1

2016-07-07T00:00:00-05:00

Webmachine is an Erlang project I’ve known and used for years, especially thanks to my experience at Basho. One of the things that made me proud to use it was its opinionated nature about HTTP, namely, that in most cases you don’t have to manually set status codes or supply response headers. Instead, you supply predicate functions that answer very specific questions about your HTTP resource, and the Webmachine FSM calls those to determine how to respond to each request. This leads to a very declarative style, which is a great fit for functional programming, and it becomes simple to extend your resource with more capabilities.

In contrast, the trend in the Elixir community is to use Plug, often in conjunction with Phoenix. Plug is very much in the style of Rack and WSGI which came before it, and I’ve ranted about the problems with those before (and will not repeat here).

That said, someone has “ported” Webmachine to Elixir and provided a complicated macro-based DSL with which to declare resources. When I ported Webmachine to Ruby, I eschewed DSLs for a simple inherit-and-override model which was incredibly successful and easy to understand, in my opinion, not to mention faster and more efficient than hacks using instance_eval.

In this post, I hope to convince you that Elixir’s simplicity and direct interaction with Erlang libraries makes it possible to use Webmachine in your Elixir project, without a rewrite or translation layer.

Before I dive into it, you should be aware some caveats.

Most Elixir projects use Cowboy or Elli for the webserver. Unfortunately, Webmachine is only compatible with mochiweb; although previously efforts have been made to connect Yaws and Cowboy, those have never completed.

There’s some cruft in Webmachine, simply because it was started before 2009. I hope it won’t be too onerous to work around.

At the time of writing, I’m running Elixir 1.2.3 atop Erlang/OTP 18.2.1 on Mac OS/X, both installed via homebrew.

I’ll assume you know a little bit about Elixir syntax and idioms, and won’t add too much discussion of the basics.

Getting Started

Let’s start by making a new project with mix, and then we’ll add Webmachine and a basic resource that returns some HTML. I’ll be roughly following the tutorial that Chris Meiklejohn and I gave to LambdaJam in 2013, which is a very basic Twitter clone called “Tweeter”.

$ mix new tweeter --sup
* creating README.md
* creating .gitignore
* creating mix.exs
* creating config
* creating config/config.exs
* creating lib
* creating lib/tweeter.ex
* creating test
* creating test/test_helper.exs
* creating test/tweeter_test.exs
$ cd tweeter

Now we can add Webmachine to the project and compile. First, edit mix.exs to add Webmachine.

defmodule Tweeter.Mixfile do
  use Mix.Project

  def project do
    [app: :tweeter,
     version: "0.0.1",
     elixir: "~> 1.2",
     build_embedded: Mix.env == :prod,
     start_permanent: Mix.env == :prod,
     deps: deps]
  end

  def application do
    # Add :webmachine here
    [applications: [:logger, :webmachine],
     mod: {Tweeter, []}]
  end

  defp deps do
    # Add webmachine dep here
    [{:webmachine,
      git: "https://github.com/webmachine/webmachine.git",
      branch: "master"}]
  end
end

Fetch the dependencies and compile!

$ mix deps.get
* Getting webmachine (https://github.com/webmachine/webmachine.git)
Cloning into '/Users/scribb201/dev/tweeter/deps/webmachine'...
remote: Counting objects: 3172, done.
remote: Compressing objects: 100% (4/4), done.
remote: Total 3172 (delta 1), reused 0 (delta 0), pack-reused 3168
Receiving objects: 100% (3172/3172), 2.89 MiB | 146.00 KiB/s, done.
Resolving deltas: 100% (1767/1767), done.
Checking connectivity... done.
A new Hex version is available (v0.11.1), please update with `mix local.hex`
Running dependency resolution
Dependency resolution completed successfully
  mochiweb: v2.12.2
* Getting mochiweb (Hex package)
Checking package (https://s3.amazonaws.com/s3.hex.pm/tarballs/mochiweb-2.12.2.tar)
Fetched package
Unpacked package tarball (/Users/scribb201/.hex/packages/mochiweb-2.12.2.tar)

$ mix compile
==> mochiweb (compile)
Compiled src/reloader.erl
Compiled src/mochiweb_websocket.erl
Compiled src/mochiweb_util.erl
Compiled src/mochiweb_socket.erl
Compiled src/mochiweb_session.erl
Compiled src/mochiweb_response.erl
Compiled src/mochiweb_socket_server.erl
Compiled src/mochiweb_multipart.erl
Compiled src/mochiweb_io.erl
Compiled src/mochiweb_mime.erl
Compiled src/mochiweb_http.erl
Compiled src/mochiweb_headers.erl
Compiled src/mochiweb_request.erl
Compiled src/mochiweb_echo.erl
Compiled src/mochiweb_cover.erl
Compiled src/mochiweb_cookies.erl
Compiled src/mochiweb_base64url.erl
Compiled src/mochiweb_acceptor.erl
Compiled src/mochiweb.erl
Compiled src/mochiutf8.erl
Compiled src/mochiweb_html.erl
Compiled src/mochitemp.erl
Compiled src/mochilogfile2.erl
Compiled src/mochilists.erl
Compiled src/mochinum.erl
Compiled src/mochijson.erl
Compiled src/mochihex.erl
Compiled src/mochiglobal.erl
Compiled src/mochijson2.erl
Compiled src/mochifmt_std.erl
Compiled src/mochifmt_records.erl
Compiled src/mochifmt.erl
Compiled src/mochiweb_charref.erl
==> webmachine (compile)
Compiled src/wrq.erl
Compiled src/wmtrace_resource.erl
Compiled src/webmachine_util.erl
Compiled src/webmachine_sup.erl
Compiled src/webmachine_router.erl
Compiled src/webmachine_perf_log_handler.erl
Compiled src/webmachine_resource.erl
Compiled src/webmachine_mochiweb.erl
Compiled src/webmachine_multipart.erl
Compiled src/webmachine_logger_watcher_sup.erl
Compiled src/webmachine_logger_watcher.erl
Compiled src/webmachine_log.erl
Compiled src/webmachine_error_log_handler.erl
Compiled src/webmachine_error.erl
Compiled src/webmachine_error_handler.erl
Compiled src/webmachine_deps.erl
Compiled src/webmachine_dispatcher.erl
Compiled src/webmachine_app.erl
Compiled src/webmachine_access_log_handler.erl
Compiled src/webmachine.erl
Compiled src/webmachine_request.erl
Compiled src/webmachine_decision_core.erl
Compiled lib/tweeter.ex
Generated tweeter app
Consolidated List.Chars
Consolidated String.Chars
Consolidated Collectable
Consolidated Enumerable
Consolidated IEx.Info
Consolidated Inspect

My First Resource

Now that we’ve verified we can build everything, let’s hook up the webserver and a basic resource to serve some content. To do that, we need to start the server as a child of our application supervisor. Luckily, we generated a supervisor when we ran mix new. Edit lib/tweeter.ex to look like this:

defmodule Tweeter do
  use Application

  def start(_type, _args) do
    import Supervisor.Spec, warn: false

    # Some configuration that Webmachine needs
    web_config = [ip: {127, 0, 0, 1},
                  port: 8080,
                  dispatch: []]

    # Add the webmachine+mochiweb listener
    children = [
      worker(:webmachine_mochiweb, [web_config],
             function: :start,
             modules: [:mochiweb_socket_server])
    ]

    opts = [strategy: :one_for_one, name: Tweeter.Supervisor]
    Supervisor.start_link(children, opts)
  end
end

Now run the app and go to http://127.0.0.1:8080/.

$ mix run --no-halt

You should get a very “Web 1.0” 404 page. Hit Ctrl-C a ENTER in the shell to exit the Mix process. Let’s add a “Hello, World” resource so we can get something other than the 404 page. Following Elixir conventions, we’ll build out our source tree to separate concerns.

$ mkdir -p lib/tweeter/resources
$ $EDITOR lib/tweeter/resources/hello.ex

Below is what we’ll put into hello.ex, along with a little explanation of why.

defmodule Tweeter.Resources.Hello do
  # Initializes the resource's state, which is nothing right now
  def init(_) do
    {:ok, nil}
  end

  # Required callback, but almost never overridden! Most people use
  # service_available/2 instead. In Erlang, we'd do this to avoid it:
  #
  #    -include_lib("webmachine/include/webmachine.hrl").
  #
  # Maybe later we'll make a macro that automatically includes it.
  def ping(req_data, state) do
    {:pong, req_data, state}
  end

  # Default body callback, producing HTML.
  def to_html(req_data, state) do
    {"<html><body>Hello, World!</body></html>", req_data, state}
  end
end

That isn’t too bad, right? You do have some strange 3-tuple return values. What are they?

The 3-tuple return values thread the request/response data and the resource state through the functions, while the first element of each is the callback-specific return value. So, for ping/2 we need to return :pong for success, and to_html/2 needs to return the response body. This pattern will repeat as we add more functionality to our resources.

Now let’s hook it up to the webserver in tweeter.ex:

    # Some configuration that Webmachine needs
    web_config = [ip: {127, 0, 0, 1},
                  port: 8080,
                  dispatch: [
                    {[], Tweeter.Resources.Hello, []}
                  ]]

What we’ve modified here is the “dispatch list”, or “routes” as they are called in other frameworks. The [] in the first position says we’ll match the root URL, or /. The second position is the module name of our resource. The third position is any additional arguments to initialize our resource (passed to init/1 on startup). Now we can try it out! Run mix run --no-halt again and refresh your browser. You should see “Hello, World!” on the screen.

Up next

In the next installment, we’ll learn some more parts of Webmachine resources to serve up some static content.

In Search of the Software Ursatz: Part 1 - Introduction

2014-03-13T00:00:00-05:00

This is the first in a series based on the talk I gave at Code PaLOUsa 2014 entitled “In Search of the Software Ursatz”.

It has been a long time since I wrote on this blog, but it has been an equally long time (or more) that I have been thinking about the topic of this series of posts.

What I want introduce to you, dear reader, is a set of musical theories that have been influential in my thought process as a musician and a programmer, in the hopes that they bring about deeper insights to all of us. This has been a difficult thing to begin writing and talking about. The concepts are very abstract, and connecting them concretely to the work we do requires both a strong gut feeling and occasional leaps of faith.

The ideas I will try to construct are also in their infancy; indeed, I proposed the talk to Code PaLOUsa with very ambitious goals, but I haven’t yet connected all of the dots. Luckily, the title of the talk and these posts are “In Search of…” not “I Found It!”. The musical theories I will discuss herein took the author over thirty years – essentially his entire career – to develop; I am just in the beginning stages of developing my theory.

That said, I apologize if it seemed through the title or the talk abstract that I have a Grand Unified Theory of Software Design. That is not the case. Instead, I will look at some important questions and point in some directions I think we could pursue in the future.

In Search of:

To begin this discussion, I’d like to outline what it is exactly that I am seeking.

First, there has been a trend in recent years to see code as craftsmanship. This seems to mean not just the creation of code to do a job, but a skilled work that also requires aesthetics beyond the measurable aspects of the product. What does it mean to be a software craftsperson? How does that quality reflect in the products of the crafters? I’d like to know the answers, or at least better understand the questions.

Second, I’d like an intuitive technique for analyzing the structure of software. By intuitive I mean that its interpretation is obvious to the experienced practitioner, and draws from a deep understanding of software construction by the analyst.

Third, I’d like a subjective means of critical comparison of software designs. That is, I’m not interested in performance comparisions, SLOC counts, but what makes the software what it is. For example, what are the defining aspects of the functional program versus the object-oriented program and why is one subjectively better in various circumstances. How do the surface features of a program convey or obscure its meaning?

Fourth, I’m looking for the why not the how. We have many Turing-complete languages and rich tools that can express essentially the same computation. We have a wealth of information on the Internet (e.g. stackoverflow) that can tell us how to accomplish things. Given so many choices, what are the designs that win out and what makes them better than others? What makes them tick?

Why care?

Those are great goals to achieve, but why do I care about them?

If we accept the idea that we should strive to be software craftspersons, we need a framework for critical thought about our craft that goes beyond surface details. We can only improve our craft if we deeply understand the things we create and turn a critical eye to our own work.

As Rich Hickey has so eloquently put it, there is a strong distinction between simplicity and ease. Simple things require deep understanding to wield, easy things are often canned solutions that are quickly outgrown.

Finally, I believe that the software systems we build reflect – and in some cases “leak” – the foundations on which they are built. Understanding those foundations and the interaction between different layers is essential to building successful, well-crafted software.

A Tale of Two Pieces

Below are two works from the Tonal period (an umbrella term encompassing works from approximately 1600 to 1850). The first is Prelude no. 1 in C from The Well-Tempered Clavier, Book 1 by Johann Sebastian Bach. Many of you will know this piece, even if you are not a musician, as it has become very popular in weddings and television spots in recent years.

The second will be less familiar to most, except the pianists. It is Etude in F Major, Op. 10 no. 8 by Frederik Chopin.

Would you believe, aside from the fact that both of these pieces were intended for keyboard instruments, that they have the same fundamental structure? They sound very different on the surface, but use the same techniques, in different combinations, for expounding upon the deep structure of Tonal music.

That realization is the genius of the theories of Heinrich Schenker, who I will discuss in more depth in the next post.