Update on Performance Issues of PLoS Websites
Performance of the websites hosted on Topaz has increased over the last two weeks with a variety of patches ported to the production servers. We still have an outstanding memory problem that requires a restart of the Topaz applications three times a day (these restarts usually occur around midnight, 8am and 4pm with a duration of less than 10 minutes). I feel that we’re close to diagnosing the memory problem which is the last performance hurdle.
We’re waiting for the Topaz developers to finish two outstanding tickets before we create a new release candidate (RC 0.8.2.2 for those that are counting). This release candidate will contain all of the patches, even better caching features and XSL/CSS bug fixes. We’ll test the release candidate later this week and the production servers will run on this RC no later than Friday, March 28.
We also have a patch for search and will test the patch tomorrow. If testing goes well, then we will be able to place basic and advanced search back on the journal websites by the end of the week.
And now for something completely different…. We had a problem with one of our production servers Friday morning which caused an extended downtime. The Fedora server mysteriously stopped all network traffic and then rebooted. Russ and Josh are bringing the servers down for diagnostic tests and maintenance from 9-9:30pm PST. During this time, the journal websites will display the “undergoing site maintenance” page.
OOME is the acronym of the month. We’ve diagnosed the problems with the out of memory error. It turns out that many of the heavy lifting Java instances in the TOPAZ publishing application are session-scoped. What does this mean?!? For every user, a session is kept until a timeout period is reached, then it is assumed that the user is no longer visiting the site. The session is discarded and the Java garbage collection frees the held memory.
This was particularly problematic for the article XSL transformation of the XML to HTML. Each user was creating there own article XSL transform which was a) really really big and b) staying in memory for a really really long time. We’ve already changed the production settings so that sessions are kept for 10 minutes rather than a really really long time and this should have immediate impact on the garbage collection of the memory.
The developers are also changing each of the Java instances from session scope to request scope so they are only available as long as the requested object is available. We’re testing the article XSL and preliminary results look promising.
More to follow as testing continues….
When I try to submit a comment through
http://www.ploscompbiol.org/feedbackCreate.action
I get
”
Not Found
The requested URL /journals/compbiol/feedback.action was not found on this server.
”
Best,
Anders
Hi Anders,
Thanks for pointing out this problem. Turned out that the URL used in the feedback form was missing a slash. Russ has put a fix in place.
Rich
Just thought I’d mention, I got the following link (note the : with no port)… it obviously didn’t work…
http://www.plosone.org:/user/secure/editProfile.action?tabId=pre
Also, adding links in comments or notes of an article is broken when there’s a single quote in the url…
With no way to edit a comment, that means, you post, and there it is, broken… (don’t think there’s even a preview option for comments, is there?)
Thanks,
Steve
Hi Richard,
Is anyone working on getting CiteULike to work with PLoS Computational Biology?
CiteULike is a great tool. And supporting it would get PLoS compbiol papers more exposure.
Best,
Anders