March 13, 2008

More Twitter weirdness

Twitter commonly has the problem of duplicate tweets. That is, if you post a message, it shows up twice. After a little while, the dupe disappears, but if you delete the dupe manually, the original is gone too.

I presume what’s going on is that tweets are cached, the tweets are eventually batched to disk, and they don’t always get deleted from cache until some time after they’re persisted. If you happen to check the page of your recent tweets inbetween — boom, you get two hits. But what I don’t understand is why the two versions have different timestamps.

Presumably, this could be explained at a MySQL User Conference session next month, one of whose topics will be Intelligent caching strategies using a hybrid MemCache / MySQL approach. I’m so glad they don’t use stupid strategies to do this …

Of course, caching weirdness is just one of many reasons Twitter needs to be rearchitected.

Edit: Here’s an interesting write-up of Twitter’s scaling strategies as of April, 2007. Twitter wrote its own queueing technology called Starling, and later open-sourced it. Hat tip to @Tricon.

Categories: Cache, MySQL, OLTP, Specific users

Subscribe to our complete feed!

Comments

3 Responses to “More Twitter weirdness”

Frank on March 14th, 2008 2:46 am

I can only imagine the extent to which Twitter applies creativity to scaling issues. I missed it last year but planning to attend this year. Not because I believe in Rails but because there should be some valuable lessons to be learned from their experience.
Jay Pipes on March 17th, 2008 2:16 pm

Hi Curt! Believe it or not, Twitter’s architecture is being discussed by Blaine Cook, one of it’s architects, at the MySQL conference:

http://en.oreilly.com/mysql2008/public/schedule/detail/631

Cheers,

Jay
Curt Monash on March 18th, 2008 8:47 am

Jay,

You just repeated a link I had in the original post — and the main subject of the post was duplication of information.

😀

Best,

CAM

Leave a Reply

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.
Text Technologies covers text mining, search, and social software.
Strategic Messaging analyzes marketing and messaging strategy.
The Monash Report examines technology and public policy issues.
Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Links
- Monash Research
- White Papers
Admin
- Log in

More Twitter weirdness

Comments

Search our blogs and white papers

Monash Research blogs

User consulting

Vendor advisory

Monash Research highlights

Recent posts

Categories

Date archives

Admin