We are made in NY

Can you spot the AppNexus shark? (hint after the jump)

AppNexus is proud to be made and coded in NYC. We are also developing around the world in eight cities, five countries, and three continents*.

Continue reading

comments (1)

Engineering @ Scale Series: Kickoff Event Jan. 30

Join us January 30 for our Engineering @ Scale speaker series kickoff event. Our first talk will be “Rapid Iteration With Python: Scaling AppNexus” presented by Optimization and Analytics Director Dave Himrod and Engineering Manager Steve Kannan in our NYC headquarters on January 30 at 6:30pm.

At AppNexus, we’ve experienced explosive growth over the last three years. This has meant the rapid scaling and iteration of our optimization tools used for big data exploration and aggregations. Unlike other more complicated programming languages, Python’s versatility allows us to use it both for offline analytical tasks as well as production system development. Doing so allows us to bridge the gap between prototypes and production by relying on the same code libraries and frameworks for both, thereby tightening our innovation loop. We’d like to share our best practices and lessons learned when iterating and scaling with Python.

Join us for beers, pizza, and Python!

And of course, the event is free.

RSVP HERE

comments (0)

A Hadoop Success Story: Horizontally Scaling Our Data Pipeline

With 2000% growth in 2011, a data pipeline bursting at the seams, and 24/7/365 uptime, AppNexus engineers faced a task likened to changing the engine of a 747 in midflight. In order to horizontally scale the pipeline and data reporting, we adapted a variety of technologies, with Hadoop, HBase, and Hive composing the core elements.

At a presentation for the NYC Hadoop Meetup on February 28, 2012, Director of Engineering Ersin Yilmaz and Senior Software Engineer Sateesh Lakkarsu shared lessons learned in terms of hardware application, monitoring, rollback, failover nodes, controlling memory allocation, and day-to-day fire drills. A recording of the presentation is below:

comments (0)

What we do for fun… redux

Here’s a little ditty Rick and Brett whipped up for AppNexus Summit NYC 2012. Enjoy!

comments (0)

Intern Series: Diary of an Ad Server Intern

These days, the guys and gals on the data pipeline team find themselves in an interesting position: wrangling 20TB of billing data and records into databases and data warehouses every single day. Our optimization team couldn’t be more pleased: the more data they have to work with, the better and more fine-tuned their bidding algorithms can be. However, this also presents an immense technical challenge for the rest of the company. Not only do we store all of this data, but we also provide reports for clients that let them know how their campaigns are doing on our platform, and we ship these reports once an hour. Finally, all of this data has to be redundant. No one would be happy if a datacenter’s worth of logs went missing.

To solve the storage and network transport problems induced by our growth, we built a system called Packrat. Its function is simple: receive data in rows, maybe store a copy, and then ship it out elsewhere.  Packrat is developed and maintained by the ad server team, since we interface so closely with our core ad server and bidding engine, but Packrat’s real purpose is to move the data somewhere it can be effectively dealt with: the data team. To further our data redundancy effort, each datacenter (three globally) restreams all its data to another. As you might imagine, we transmit a tremendous amount of data all over the world, and network bandwidth is a huge factor in capacity planning and our overall growth.

When I started at AppNexus, I was told that one of my big projects for the summer would be compressing large portions of our data before it crosses the wire.  Continue reading

comments (3)