All posts by ajohnson

Java, JTDS, PreparedStatement and varchar

December 2, 2005 ajohnson 6 Comments

I’ve been working on an interesting application at work that needs to be fast, the faster the better in fact. I wrote a couple quick and dirty implementations in my scratchpad in Eclipse and I figured that I could get about fifty operations per second (a database UPDATE is involved for every operation among other things). Anyway, I went to develop a full implementation and a then ran a full test of about 100,000 operations. Instead of taking about 30 minutes (100,000 operations / 50 per second = ~ 30 minutes) the operation took about 7 hours. I was getting about 4 operations per second throughput, which was obviously a huge disappointment. The pseudocode I wrote originally looked something like this:
Connection c = DriverManager.getConnection(cs); String q = "UPDATE mytable SET x = 1 WHERE id = ?"; PreparedStatement p = c.prepareStatement(q); for (int i=0; i
and it worked well. I made a single change during development: instead of using the ‘id ‘ column of the database table (a numeric 9 byte primary key and thus is the clustered index for the table) I used a 13 byte varchar column as the identifier which had a nonclustered index, my code looked like this:
Connection c = DriverManager.getConnection(cs); String q = "UPDATE mytable SET x = 1 WHERE y = ?"; PreparedStatement p = c.prepareStatement(q); for (int i=0; i
The nonclustered index performed just as well as the clustered index: in my testing an UPDATE statement using the varchar column as the constraint in the query worked just as fast as the primary key / clustered index, which makes sense because index seeks (which I learned about in my database design class this semester) on a 9 byte / 72 bit numeric value (because I used a precision of 19 digits) should be similar to index seeks on a 13 byte / 104 bit varchar column. So then I executed the finished program (not the test) and brought up SQL Profiler (a tool that ships with SQL Server that can debug, troubleshoot, monitor, and measure your application’s SQL statements and stored procedures). It quickly became clear what the problem was. Here’s the SQL created by the prepareStatement() method:
create proc #jtds000001 @P0 varchar(4000) as UPDATE mytable SET x = 1 WHERE y = @P0
and then the executeUpdate() method:
exec #jtds000001 N'005QDUKS1MG8K'
See the problem? The JTDS driver turned the 13 byte varchar column into a 4000 byte varchar column (the maximum number of bytes for a column) and then prefixed the parameter with ‘n’, which is used to identify Unicode data types. This substitution caused the query processor to ignore the index on ‘y’ and do an index scan instead of an index seek.

Here’s where is gets fun. Microsoft SQL Server uses a B-tree index structure (also on wikipedia), which is similar to a B+tree, except that search key values can only appear once in the tree. Objects are stored in SQL Server as a collection of 8KB pages and (because of the class I’ve been taking) I now know that you can compute the approximate number of disk IO’s for an index seek as:
log_n/2(k)
where n is the number of keys per node and k is the number of search keys. So with one million search keys and 8KB pages in SQL Server, a index on a 13 byte key would create a tree with about 615 nodes (~8000 / 13 = ~615). Thus the index seek in my system was costing about log_615/2(1000000) = 2.4 node accesses (one node access ~= one disk IO) versus an index scan (615 nodes @ 8KB each, figure that on average over time we’ll find the value in 615/2 so ~307 node accesses?) which is significantly longer and obviously the cause of the problem.

Moral of the story: watch out for char / varchar constraint parameters when using JTDS and a PreparedStatement. Also, indexes are A Good Thing^TM.

Updated 12/04/2005: Brian Heineman (one of the maintainers of the JTDS project) points out that this is a feature, not a bug. He also points out that you can work around the issue by appending:
sendStringParametersAsUnicode=false;
to your database connection string (I tested it out and it works just as advertised). Since the real issue is that JTDS can’t tell if the String instance I’m sending is Unicode or not and so defaults to a Unicode string, the other workaround would be to use the setBytes() method of the PreparedStatement and the use the byte[] representation of the String. From my example above:
p.setBytes(1, somearray[i].getBytes());

Daily Links, Ruby on Rails

Links: 12-1-2005

December 2, 2005 ajohnson Leave a comment

SwitchTower: Automating Application Deployment
SwitchTower is a standalone utility that can also integrate nicely with Rails. You simply provide SwitchTower with a deployment recipe that describes your various servers and their roles, and voila! You magically have single-command deployment. It …
(categories: automation deployment rails ruby )
RDT – Ruby Development Tools: Welcome
RDT is an open source Ruby IDE for the Eclipse platform. Features supported are syntax highlighting, on the fly syntax check, graphical outline, Test::Unit view/runner, Ruby application launching, content assist, source formatter, and a Ruby debugger.
(categories: eclipse ide ruby )

Daily Links

Links: 11-27-2005

November 28, 2005 ajohnson Leave a comment

Paul Graham on Web 2.0
My favorite quote: “Basically, what ‘Ajax’ means is ‘Javascript now works.”
(categories: ajax paul_graham quotes )

Personal

Thanksgiving

November 26, 2005 ajohnson Leave a comment

I think it was my friend David that pointed me to a blog called slacktivist, but I don’t remember when or how. Fred Clark, the author of that blog, recently attended the memorial service for his friend, Dwight Ozard, who passed away on November 14 and posted an excerpt from the program. I thought the last paragraph might read better as a directive and would be an apt post for this Thanksgiving weekend:

Suck the marrow out of the bones of life, live deeply and without reservation, hesitation or fear. Breathe deep the air of spring and the breath of God, savor the depths of flavors and the best of God’s good earth; “taste and see” the goodness of God, hold the things that matter close to the heart, treasure the hard parts of loving and frolic like a 4-year-old in a mud puddle in the good, easy parts. Don’t waste this thing. It’s too good.

Daily Links

Links: 11-24-2005

November 25, 2005 ajohnson Leave a comment

Test Run: Home
A cool tool for QA people: free for single users, also a hosted or a downloadable versions.
(categories: php qa software testing )
Mp3Nova.org Â» Radiohead — Me And This Arm
Radiohead mashup.
(categories: mashed radiohead )

Books, Daily Links

Links: 11-23-2005

November 24, 2005 ajohnson Leave a comment

Reading List: Fog Creek Software Management Training Program – Joel on Software
Great list of books to read over the Thanksgiving holiday weekend…err… maybe the next 12 Thanksgiving holiday weekends.
(categories: books software toread )

J2EE, Ruby on Rails, Software Development

Ruby on Rails in the Java community

November 12, 2005 ajohnson 2 Comments

Couple of months ago I attended the No Fluff Just Stuff conference up in Framingham, I took a bunch of notes which I intented to post to this blog, but never got around to it. The conference tag line is “The best value in the Java / Open Source conferencing space hands down” and I’d have to agree, although the emphasis on Ruby on Rails was surprising. Turns out that a number of the speakers who make their living consulting and writing books about Java have taken up Ruby on Rails and so maybe 25% of the sessions were about Ruby on Rails (the session by Dave Thomas was maybe one of the best conference sessions I’ve ever been too). I guess all this is to say that it’s not a surprise that the next ACM WebTech group meeting in Waltham is going to be about Ruby on Rails.

Daily Links, J2EE, Ruby on Rails

Links: 11-3-2005

November 4, 2005 ajohnson Leave a comment

Bogleâ€™s Blog Â» Ruby On Spring
JRuby, Spring, Hibernate, a different combination of some hot acronyms.
(categories: hibernate jruby ruby spring )
XML.com: REST on Rails
How to use REST with Ruby on Rails.
(categories: rails rest ruby xml )

Daily Links

Links: 10-26-2005

October 27, 2005 ajohnson Leave a comment

ferret – Trac
A Ruby port of Lucene.
(categories: lucene ruby )

Daily Links

Links: 10-24-2005

October 25, 2005 ajohnson Leave a comment

Flock: Getting Started With Flock
Wow, the integrated search of all the pages I’ve visited seems like a really nice feature.
(categories: clucene flock )

Aaron Johnson

All posts by ajohnson

Java, JTDS, PreparedStatement and varchar

Links: 12-1-2005

Links: 11-27-2005

Thanksgiving

Links: 11-24-2005

Links: 11-23-2005

Ruby on Rails in the Java community

Links: 11-3-2005

Links: 10-26-2005

Links: 10-24-2005

Now with 50% less caffeine!