Tuesday, 28 October 2008

RDFa - so, WTF?!

As I write, I'm in my hotel room at the International Semantic Web Conference 2008 in Karlruhe. We're only half way through, but its already been a thought provoking and eye opening two days. One of the big topics I've been bottoming out is what exactly RDFa is, what it can do for us/you, and what are its problems. The thoughts in my head were all started with an excellent tutorial on Sunday by Michael Hausenblas. The answers to all three aren't easy. So I thought I'd try and make sense of it - for my benefit and yours - in this blog post.

Adding Semantic Meaning to Pages

We all want to get more from our web pages. As documents, they're pretty good, but we want to extract more meaning from them. This would allow loads more interactivity and interlinking than is currently possible. For example, one-click adding of info to address books and calendars, showing more in-depth information alongside articles, and automatic related linking. And these are the really simple basic ambitions.

Good web developers should already be using semantic HTML - where as far as possible the use of tags matches the real structure of the data you're representing. But this isn't enough. We can see its a list, but a list of what?!

So, there are two core technologies which are both attempting to solve this problem. The first is Microformats, the second is RDFa.

Microformats make use of existing HTML tags and attributes to assign extra meaning to HTML documents. This generally means adding special class names to class attributes to allow a Microformat parser to understand the content of the tags. There are several common Microformats for marking up data such as calendar events, contact information, geographical information, social relationships, copyright information, reviews etc.

Yahoo (via Peter Mika of Yahoo SearchMonkey) says that around 2% of pages on the web contain Microformats. That's pretty good for an emerging technology. They even have a nice logo - one of which adorns the lid of my laptop. But Microformats have their issues.

Firstly, theres a fairly limited (although useful) set of things which you can markup with Microformats. If you want to markup something new, you have to suggest a new proposal, and have it approved and ratified by the Microformats community, before having the parsers implement it - often slighly differently.

Secondly, Microformats have some minor accessibiliy issues. They make use of something they call the abbr design pattern, where the title attribute of an abbr tag is often used to convey non-human readable date time information. This can be read out by some screenreaders, appear in tool tips, and generally misuses the abbr tag. Not ideal.

Another Solution?

So how to get get round these issues? Another solution is RDFa. RDFa is "RDF in Attributes". As the name suggests RDFa adds more attributes to XHTML, and these attributes are designed perfectly to hold real, proper RDF data. Parsers then have a really simple job to distil pure RDF straight out of your otherwise beautiful XHTML document.

As RDFa is built on the foundations of RDF, you can use any RDF ontology in your document, and if theres not one which suits, you can make your own. Its naturally extensible without having to ask the permission of a central community, and hope parser makers follow your reccomendation.

Hang on Hang on! RDF..what now?

To understand RDFa, you need to know what RDF is. I'll over-simplify deliberately here, but RDF is most commonly seen as an XML format which is used to describe relationships between things, or add properties to things. RDF documents most often look like XML, and contain what are called Triples. A triple is a set of three statements that tell a little story.

  • Dave likes Cats,
  • Simon's nickname is "Si",
  • Chris Martin is married to Gwyneth Paltrow,
  • U2 released the album Achtung Baby
Sounds pretty simple eh? Well it kinda is. These three statements in a Triple are called the Subject (what's it about), Predicate (the kind of data we're adding to the Subject) and the Object (the actual data). Ideally Subjects will always be URIs, preferably URLs which means they're addressable on the web. Predicates should also be URIs. This is really important. Predicates are actually defined in Ontologies. This means for every RDF Triple, we have a deep, unambiguous meaning for the relationship described in it. Finally, the Object is the last part. It can be a string literal - i.e. "Si" in the Triple "Simon's nickname is Si" - or it can also be a URI - so in the Triple "Chris Martin is married to Gwyneth Paltrow", both Chris Martin and Gwyneth Paltrow should be represented as URIs - perhaps their Wikipedia entries.

Right, that's enough RDF theory. If you're interested it it, go read more on the web.

Back to RDFa

To recap, RDFa lets you embed real RDF directly inside XHTML documents in new and extended attributes. But, wait a sec - you can't just make up new attributes, or move attributes onto tags which can't support them. Correct. That's why RDFa can currently only be used in XHTML 1.1 documents. For a page which contains RDFa to validate it has to use the XHTML 1.1 doctype. This new doctype...

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">

is the clever bit that means we can use the 'resource', 'property' and 'about' attributes on more tags than was possible before. The second part is that because we're in XHTML, we can add XHTML namespaces to our document. Just like in pure RDF (i.e. RDF/XML), this is where we import the Ontologies we want to use to describe the things in our document.



I don't want to go into too much detail of RDFa syntax here - there are plenty of examples on the web - I'll just point to the best ones. You can learn RDFa for yourself.
Good. Lots of bedtime reading for you to do! Seriously the best way to get to grips with RDFa is to have a go at marking up a document yourself. Its not that hard once you get used to the syntax, and learn the common RDF predicates.

Instead of teaching the world RDFa, I want to concentrate now on the practicalities of RDFa. testing, validation and data extraction.

Serving and validating RDFa

Let's deal with the utopian ideals first. If you're going to put RDFa into your document, it should be served by your server with the mime type "application/xhtml+xml". Also, you should use the XML+RDFa doctype mentioned earlier. You should also ensure you put a version attribute of "XHTML+RDFa 1.0" on your root html node.

Ok ok ok. If you do all those things, your page with RDFa in it will validate. But we all know that serving pages with 'application/xhtml+xml' causes issues with older browsers. Is this an issue? Well, according to Michael Hausenblas, RDFa interpreters are beginning to understand this, and should now attempt to parse documents containing RDFa even if the mime-type smells a bit wrong - 'text/html' for example. This should mean that you can just start embedding RDFa into your page, serving it under an old mime-type to keep IE6 and below happy, while still allowing people to distill juicy semantic goodness from your documents. However, if you're a bit cleverer, you could use content negotiation to serve your page to older browsers using the 'text/html' mime-type, and 'application/xhtml+xml' for user agents like shiny new browsers which can support it'

The mime-type is one thing, but what about that crazy ass new doctype! Well, firstly you'll notice it's XHTML 1.1. Since we're at v1.1, theres no such thing as strict, transitional and frameset. Its all strict baby. This means if you want to do RDFa properly, all your documents need to be properly XHTML 1.1 complient. Thats not such a big ask if you've been doing your job properly for the last two years or so.

Well, once again, all is not lost. It depends on wether or not you care about validation. The only way to validate a page with RDFa in it, is to use the correct, new doctype. But standards complient browsers should just ignore any attributes they find which they don't understand, and just get on with things. So, the theory is, that you can just go ahead and stick RDFa into your pages, and nothing will break - visually at least. The only thing that will stop working is validation.

Certainly I've tried RDFa in a HTML4.01 Strict document, and while it no longer validated (" there is no attribute 'ABOUT'" etc), it still displayed perfectly in Firefox 3 (as you'd hope) and the Operator plugin manages to get all that juicy data without a hitch.

So again, the theory is, just use RDFa loosely, and if you don't care about it causing errors in your documents, you'll be fine. Michael Hausenblas again says he speak to many organisations who's pages have 200 errors, so another 30 don't really matter.

I disagree with this. I think validation is important. I also think that if you get used to having errors due to RDFa then you'll be blind to the errors you really ought to fix. However, with bleeding-edge technologies like this, you have to be pragmatic. In this case I think it's probably such a benefit to have RDFa, that validation errors should be stomached for now. HOWEVER, if possible you should provide a programmatic way to remove all the RDFa statements such that the page will otherwise validate.

I'm going to try and do this in my apps using helpers in my MVC Views which will output nothing if a no-rdfa flag is set, perhaps in the query string, but otherwise will put RDFa into the page. That way, you can have a core page which validates against your older but widely supported doctype, and you only break it with the RDFa bits. Just make sure all your links to validators have the ?rdfa=false flag set! This is untries and untested in practice, but seems like an acceptable solution to allow us to begin using RDFa in older documents now.

Developing and Testing RDFa

Its one thing to talk about it, and do some demos, but what about developing in the real world. Well, its kinda easy with RDFa, just like it is with RDF!

First, get yourself Firefox and the Operator plugin. This baby will light up when it finds microformats or RDFa in the page, and allow you to inspect it. Think of it as Firebug for Semantic Data.

Then, start using the W3C's Validator. If you're using RDFa in an old doctype, you can force it to use the new XHTML + RDFa doctype, which at least will show up any errors in your RDF syntax, if not the extracted meaning.

Finally, you'll be wanting the W3C's RDFs distiller. This will parse your document, and extract pure RDF/XML from it. Its brilliant, and will show you the power of the monster you've created. Forget Microformats and GRDDL and creating XSL for every Microformat you use, as we're building on the extensability and structure of RDF, the parser has all the info it needs to make full sense of your data - all on its own!

Reading and using RDFa

Now you've put all this in your document, you want to ensure you get it out. There are a number of parsers for most popular languages - but the most interesting for me is Javascript.

The key benefit of in-page semantic markup is that, yes, machines can read it - but that machines can read it for the direct benefit of the user. What I mean is that suddenly the page should come alive with calls-to-action, extra data and interactivity which just wasn't there before. Now only 20% of the world use Firefox - and very very few of them have Operator installed. MS IE8 will have Accelerators which use a modified set of Microformats to provide follow on actions for the user. This is a clear user benefit, and one we need to use RDFa to achive if its to have any real success.

My proposed solution is a javascript-based toolset for exposing and using semantically embedded data on a page. Basically the same as Operator, but in JS, cross-browser and with incredible user interface elements. Site owners could then include this in their page. When a page loads it would quietly interrogate the dom for Microformats and RDFa. When it found some, a button somewhere in the header might begin to glow. When clicked, it might either open a panel and show you all the data in one place, with relevent onward actions - OR it might make the semantically marked up parts of the page glow, and offer on-hover contextual user actions.

If we do this - suddenly theres a clear user benefit - and our bosses will suddenly take more interest in this Semantic data thing. As a side effect, machines across the world suddenly get access to all this juice data which has been hidden from them for so long.

What goes in RDFa and what doesn't?

In theory, RDFa allows you to take any RDF triple and embed it contextually in a page. But do we really want to do this?
  • It makes the page bigger - bad for mobile devices which are unlikely to want it
  • The more RDFa you use, the more likely your HTML structure might be compromised by bad markup
  • And isn't that what we have full-fat RDF for?!
On the latter point, YES! we still have full RDF. Also, unless your insane, you'll subscribe to the software principle of Don't Repeat Yourself (DRY). So there's a tension here. we don't want hardcore RDF in our html pages, and we don't want to repeat everything. What to do?

After talking a lot about this at ISWC2008, we think there are some guidelines to follow.
  1. You should use RDFa for simple things that a machine and a user might want to know
  2. You don't put complex RDF in RDFa - you still use RDF/XML for that.
  3. You SHOULD repeat yourself for the simple data you put in RDFa - DO replicate this in RDF/XML. We don't want machines to have to look in two places - but this is the only time you violate DRY
  4. Acceptable RDFa things are from ontologies like Dublin Core, FOAF, SIOC, and Geo - plus the simple bits from more complex Ontologies like Music, Programmes etc.
  5. Focus on things like document metadata (creator, relationships, document meaning etc) not on hardcore URI to URI mappings. Do that in pure RDF/XML.
Phew. Its too early for that to be a tested definitive list - but it feels right. RDFa is more useful when you think of it like a highly structured Microformat rathern than actual RDF.

Conclusions

RDFa is now an official W3C recommendation. This means its time to play. The biggest problem is its never been tried and tested. Its been used on the common browsers, but we're messing with HTML here, and how older devices and browsers might handle it is anyones guess. The biggest one being mobile phones and other portable devices.

Only one way to find out though - GET PLAYING! Start using really simple Dublin Core metadata in your page and blog to markup titles, descriptions, people, meanings tags etc - and see what happens - see if anyone complains, and try and fix the problems as they arise.

This is cool and new - but with RDFa and Microformats taking off (we hope) the practical web is going to be a much better place!

Sunday, 26 October 2008

Excellent preso on Webapp Security by Simon Willison

Web Security Horror Stories
View SlideShare presentation or Upload your own. (tags: security csrf)

Monday, 29 September 2008

CSS Systems

Here's an excellent presentation from Natalie Downe which she gave at BarCampLondon08 which I sadly had to miss.

I love the way she's thinking about really how to structure her CSS - its something some of us do, but not all of us, and not enough.


CSS Systems
View SlideShare presentation or Upload your own. (tags: barcamplondon5 html)

I'm in a Band!

Check us out! Then come see our gigs!

Wednesday, 3 September 2008

is exploring all the cool places he's going to visit in NZ on Google Earth. Its like being here. But more rubbish.

Tuesday, 5 August 2008

My preso to the BBC's Semantic Web interest group


Honeypot to Semantic Web interest group at the BBC

From: sicross, 1 day ago








SlideShare Link

Installing the Facebook Open Platform

Facebook is cool. It's a little past its best now (it may be seen as 2007's YoYo) but the technology underneath it is proven, scalable and solves many of the problems any site wanting to introduce some 'magic social dust' contains.

Now, Facebook have released their Facebook Open Platform. This looks interesting. Not sure what I can do with it yet, but its a nice thing to start messing around with - particularly the FBML and FBJS implementations - FBML especially as it's a lovely thing which we can use for all kinds of simple site layout things.

It wasn't totally easy getting it up and running, but here's my method which you're welcome to follow if you'd like.

Background

I'm running Apache on CentOS with PHP 5.1.6 already installed and working with Apache. In this tutorial, I'll assume you have:
  • Apache 2+
  • PHP 5+ working with Apache
  • CentOS
  • FTP/sFTP access to your server
  • command line access with sudo permissions where needed (for restarting Apache and editing httpd.conf for example)
  • MySQL 5+ working with PHP

Getting Started

  1. Grab the code from Facebook: http://developer.facebook.com/fbopen/
  2. Unzip
  3. FTP to your server and create a folder in your webroot like 'fbop'. This will be where your instace of the platform lives i.e http://www.example.com/fbop/
  4. Upload the contents of the 'html' folder in the unzipped fb-open-platform folder into your new 'fbop' folder on your web server. If you now go to http://www.example.com/fbop/fbopentest/fbml.php you'll see a ton of errors. That's okay, we'll fix them in a bit.
  5. Now, choose where you want your facebook open platform libraries to live. Sensibly, this will be away from your webroot, but with all your other php libraries. I've chosen /var/www/fblib/lib. Go ahead and create this folder and upload the contents 'lib' folder from your unzipped fb-open-platform folder.
You should now have the following files uploaded to your webserver in roughly the following ways.

  • /var/www/fblib/lib/
    • api/
    • common.php
    • core/
    • display/
    • .....
  • /var/www/html/fbop/
    • api/
    • canvas.php
    • common/
    • fbml/
    • fbopentest/
    • js/
So far so good. But we still have all those errors at http://www.example.com/fbop/fbopentest/fbml.php

Configuring Apache

Facebook have been a bit wierd and rather than telling PHP where to look for includes like most apps, they've set a custom Apache variable which PHP then uses to find some files. This isn't easy info to find out by Googling, but its very simple to implement.
  1. Open your httpd.conf file.
  2. At the bottom, or near your other non-standard config changes, add the line...
    SetEnv PHP_ROOT "/var/www/fblib"
  3. This sets a new variable constant in Apache called 'PHP_ROOT'. From within PHP this is now accessible using the $_SERVER["PHP_ROOT"] variable. The lack of this variable being present was that cause of all those errors we've been seeing.
  4. Restart Apache. For me on CentOS, this means running $sudo /sbin/service httpd restart
  5. You should now be able to test if this has worked by creating a new PHP file, and adding (inside you php braces, this line... print_r($_SERVER); . Opening that in a browser will list all the server variables available to PHP. Magically, one of them should be PHP_ROOT with the value you specified earlier. This is good.
Now, going to http://www.example.com/fbop/fbopentest/fbml.php will give us no errors! Sadly it won't do much else.

Configuring the database

Just like Facebook itself, running apps on the Platform requires the Platform to keep a list of all the applications running on it. In the Open Platform, this is held in a MySQL database. Helpfully, they provide you with enough data to get you started, you just have to load it into MySQL.
  1. Connect to MySQL using your favourite client - I'll use phpMyAdmin
  2. Create a user which which we'll give to the Platform so it can access the database
  3. In the SQL Tab, copy and paste the contents of the file /fbopen_data_dump where fb-open-platform is the archive you downloaded and unzipped from Facebook
  4. Run the Query
You should see a database called 'fbopentest' has been created with 12 tables. If you can't see this, the query may have errored. Have a look at your debug and see why. In my case, phpMyAdmin threw an error at the comments which had three dashes preceeding them '---' instead of the usual two '--'. Odd, but thats textual SQL for you. It never works first time. Changing those in the three lines in which it appeared got me up and running.

Now we have to configure the Platform to talk to the database.
  • On your web server, edit a file in the fb open platform lib directory - in our example, I mean this file: /var/www/fblib/lib/core/init.php
  • Change $DB_USERNAME to match the user you created. In my case it was 'fbopenplatform'
  • Change $DB_IP to match IP address of your database server. In most cases, this will just be 'localhost'
  • Change $DB_PASSWORD to match the user's password you created earlier. I'm not going to tell you what mine was ;-)
  • Save the file.
To test if this has been successful,

Hold up. You with me?

Before we go any further, lets test that the basics of the facebook open platform are up and running. In a browser, go to your http://www.example.com/fbop/fbopentest/ folder. You should see a nice list of test file we can use. At present, the most interesting one is http://www.example.com/fbop/fbopentest/fbml.php. You'll see that it loads, and you can see all the code if you view source, but its not parsing it into HTML from FBML. That's because we haven't installed the FBML parsing libraries yet!

This is a little more tricky for the novice as we're not messing about with PHP anymore - but proper unix and C++ stuff, but lets have a go. I would however, ensure you have a backup of your server or virtual machine. If this screws up, it has the capacity to ruin your install of a number of other things as Sean B discovered.

Installing the FBML Parser


Back soon to finish this off.

Sunday, 27 July 2008

How to Quit Twitteriffic

Twitterific rocks. Its simple, cool, and once you have Growl installed, its not as annoying. Still, sometimes it goes wrong and you need to quit it. This isn't obviously apparent as applications which sit in the menubar don't show up in the Force Quit menu (which you get by pressing Apple+Alt+Esc).

Normally you can quit Twitterific using the Quit Twitterific command in the 'spanner' or 'wrench' menu - i.e. Settings.

But, when Twitterific crashes, or any other app for that matter, you need the Activity Monitor. For those of us who are new to Mac OS X and come from Windows, thats the same as the Task Manager you can get by pressing Ctrl+Alt+Delete. In a Mac, its a standalone app. Search for it in Spotlight: "Activity Monitor".

There, you can find all the services running, including Twitterific, and close it there.

Friday, 25 April 2008

More on Grids

The more I read about this subject, the more interesting it becomes. Try this presentation by Khol Vlnh at SXSW Interactive last year. Well, that's why we use grids.

Then try Mark Boulton's post about Blueprint. Not sure I agree with all of this tho (as per my previous post).

And then theres this.... http://webpatterns.org/. Its not there yet, but looks very cool.

Wrestling with CSS Frameworks

Say you're working on a project where you need to standardise your entire layout to a grid system, and you want your entire organisation to conform to this in a single way in code? Not easy. Everyone does their layout differently, but it looks the same! Why do this, why not code the same way to make things look the same. The best way, the right way.

Granted this doesn't work for decor, typography etc, but for layout it does. This is where some CSS Frameworks come it. The main ones being YUI Grids and Blueprint CSS.

On the face of it, they looks great - especially Yahoo's Grids Builder interface. Build a page in 8 clicks, copy paste the code. Done. How easy was that!

BUT. The world is never so rosy. What about semantics? Even back in 2006, Elliot Swan talked about how un-semantic the markup is when produced using these frameworks. The principle objection is that you have to have these ugly class names like "span-6 append-3 last" all over your beautiful document structure which relate not to the type of content, but purely to the layout. Not Cool.

Or is it? Here comes pragmatism v.s. purity again. In an ideal world, what we need in CSS is class-based inheritance. So you can define a class like "menu", give it some decor code, and then tell it to inherit another class like 'span-6' which makes it fit your grid. How cool would that be. You could program your CSS just like an app. You'd never have to repeat yourself in CSS declarations, your markup could be clean, single-class-per-element, and semantic.

However, the world isn't like that. The only way to get an element to work like this it to give multiple classnames to an element -- OR -- manually build your site's css to match the grid based on your semantic classnames - meaning that just like we do now, we have to re-define the grid every time.

I'm not decided on this yet, but I'm leaning towards the pragmatism of ugly grid-based classnames ALONGSIDE nice semantic classnames too. Something like this... class="5cols menu" where '5cols' is your grid CSS, and 'menu' is obviously your semantic one.

Why do I think that? Well, layout is already implied by the position of the content in the DOM - or it should be in 90% of cases. This means that layout CSS, specifically grid and only grid layout CSS is already implied in the DOM, so why do you have to be semantic with it? Other CSS stuff like microlayouts (borders, padding), decor and typography CSS for example DO belong under semantic classname definitions.

Its a delecate balance between purity, and a pragmatic 'lets get things done' attitude. For the latter, CSS Grid frameworks make a lot of sense. However, when used carefully, they can live alongside pure semantic markup.

At least this is the best we can do until css inheritance comes along -- which'll be never.

Want another opinion? Have a read of Nate Koechley.

Friday, 18 April 2008

CakePHP Tree Behaviour

Its often really useful to be able to represent data in a tree, or self-referential way - like product categories, music genres, organisational structures etc. Often, that takes quite a bit of code. However thanks to CakePHP, we can do it in far less lines of code, thanks to a behavior var $actsAs = array('Tree');.

First create a db table as normal with an id and title. Then add three special fields: 'parent_id', 'lft' and 'rght' - all Integers. Cake uses these to manage the hierarchical relationships between each row, and it does this all behind the scenes. Next, add the behaviour tag to your model. My example is for Genres. and its as easy as this... in genre.php...


var $actsAs = array('Tree');


Some of the clever stuff happens in the Controller to make nice URL's so lets look at that now.

What I want is a URL structure like this www.example.com/genres/genre1/genre2/genre3 etc where genre 3 is the child of genre 2 which is the child of genre 1 etc. To get this behaviour, we first add a line to our app/config/routes.php file.


Router::connect('/genres/*', array('controller' => 'genres', 'action' => 'index'));


Its a very simple line which routes any sub URL to the index action of the genre controller. Then we can do all the goodness there. So, in our genres controller, we need to get to work building a clever index action.

First, we get the current URL, split it into an array on every slash using explode. If there are no other URL paths, we assume we're at the route and list all top level genres accordingly. If there are other paths, then we can lookup current genre (the last element in the array) from the database.


function index(){
$url = $this->params['url']['url'];
$genreUrl = explode('/',$url);
if (count($genreUrl)>1){
$genreCurrent = $genreUrl[count($genreUrl)-1];
if($genre = $this->Genre->findByTitle($genreCurrent)){
pr($genre);
}else{
$this->Session->setFlash('There such genre as ' . $genreCurrent);
$this->redirect(array('action' => 'index'));
}
}else{
$genres = $this->Genre->findAll("parent_id IS NULL");
pr($genres);
}
}


There are some problems with this so far. It only works if all your genre names are unique, and worse, allows you to specify any path (like www.example.com/genres/blah/this/is/wrong/validGenreName) where validGenreName is valid at any level, but the path to it does not reflect it's true higherachy. To fix this, we're now going to parse every genre given in the URL, and check to see if the one before it, and the one after it in the DB match those in the URL. This way we can ensure the URL is a true reflection of the database structure.

I've got to go now, but I'll come back and finish this post when I get another chance.

There's more info on Tree Behaviour in the CakePHP 1.2 manual.

Text to Speech

Now this is incredible.

http://www.cereproc.com/demo.html

And this... https://www.cepstral.com/demos/

Text to speech has come on a long way from the Stephen Hawking-esq stuff of the 80's.

Wanna have a real play? Make George W Bush say some dodgy stuff in this demo.

Thursday, 10 April 2008

HABTM Relationship building in CakePHP

Yeah, I know its not as good as rails, but i'm trying to get to grips with it anyway: CakePHP. I'm gonna start using this blog as a way of keeping a note of things I learn as I build my first apps - mainly for my own benefit, but also to help the active and very friendly CakePHP community. Documentation for 1.2 is currently shocking, so it takes a lot of trawling to get what you need - so I need my own reference.

That said, heres my first nugget of gold.

Has And Belongs To Many (HABTM) relationships are great, but manipulating those relationships is hard. If you want to create a new object, and save relationships to it a the moment of creation, I found it hard to work out how to do both at the same time. Turns out its all about how you push data into $this->data and then pass this to be saved.

I needed to create a new Bulletin object, tie the current logged in user to it, and fill it with 5 (currently random) news stories which already exist. First I found the user_id of the current user and added that relationship to the Bulletin object using...

$this->Bulletin->create();
$user = $this->Auth->user('id');
$this->data->Bulletin->user_id = $user;

Then I got an array of the stories I wanted to attach (currently 5 random ones). Then, heres the tricky part, I had to extract their arrays, as the data findAll() returns can't be shoved into $this->data. Instead, $this->data wants an array of IDs which it can save. To do this, I did it messily....

$stories=array();
$this->Bulletin->Story->recursive = 0;
$stories = $this->Bulletin->Story->findAll(null, null, 'RAND()', 5);
$story_ids = array();
foreach($stories as $story){
$story_ids[] = $story["Story"]['id'];
}
$this->data->Story->Story = $story_ids;

Now that $this->data contains everything we want, we can just save it as normal by passing $this->data to the $this->save() method.

if($this->Bulletin->save($this->data)){
$this->Session->setFlash('A new bulletin has been generated for you');
$this->redirect('/bulletins/view/'.$this->Bulletin->id);
}else{
$this->Session->setFlash("There was an error building your bulletin");
$this->redirect('/bulletins');
}

Voila. This works nice. I hit a url like /bulletin/generate, and the system generates a random Bulletin object for me, and redirects me to its view page. Its a bit hacky tho, so any suggestions would be greatfully received.

Friday, 4 April 2008

Draggable EPG

So I've been playing with re-creating a google maps style draggable interface for time-based data - anything you can plot with time on the X axis. Looking around for data to try, I thought I'd give 'schedule' data a try.

The cool stuff here is the JS/CSS stuff I've built which allows you to drag around an infinite-size surface, and the JS which interprets the users interactions, and loads in new data accordingly. It seems nice, and has got some nice noises from the guys at the BBC - we're thinking of ways to use it, perhaps the Olympics data, perhaps Glastonbury stages etc - who knows.

I'll post more about how it works later, and its still a work in progress - the code is rubbish and needs improving, but its a start. Take a look... http://www.simoncross.com/epg

At Over the Air

So, today, I'm here... http://overtheair.org/blog/ Which is nice. I'm not staying overnight tho, thats for the proper geeks.

We're here with a bunch of people from the BBC including Jason Quinn, Fraser Pearce, Tristan Fearne, Matt Wood, Paul Clifford, Chris Yanda et al.

We're planning on going to a few sessions, taking a few pics, and doing some mobile web hacking. How much of each we'll see.

Its just about to get started, so I'll keep blogging when theres stuff to say.

Wednesday, 26 March 2008

GSV: Pan & Scan

Wanna make your own Google Maps style viewer for massive images? Checkout Pan & Scan. I'm currently trying to build a draggable electronic programme guide interface, and this has given me loads of ideas....

Principally that rather than relying on widths and floats, I need to absolutely position everything to make it work - this is because a) you can't reliably apply heights and widths to inline elements (and inline-block in css is not properly supported), b) floats will always wrap inside a div with overflow:hidden set and c) if you want an infinitely draggable surface, you need to go back in time as well as forward, and absolute positioning is the only way to position things at minus (-ve) positions i.e. earlier than the on-load view.

Thanks Michal Migurski!

Freesat is nearly here

So the TV aerial on my new house is rubbish. Watching Eastenders which freezes every 3 seconds is more annoying than watching the real thing. I invited a company to come round and take a look. Even thought I'd told them the aerial was on the 4th floor, they still turned up, told me it was 'too high' and that I now needed to pay them a £30 callout charge. If I really wanted them to fix it, it'll be £280 - and that I might consider Freesat from Sky. On commission I wonder?

I'd already beaten them to it on this one. Freesat from Sky just needs a sky box, dish, and a £20 card you can order from Sky, and that's it. I didn't have a sky dish, so I brought one off Ebay and fitted it myself. Seriously, it was easy. Thing is, Freesat from Sky is rubbish.

You only get a small subset of the freeview channels (no Dave, E4 or More4) plus lots of rubbish, and worst of all, you can't customise the EPG to only show the 10 channels you can get worth watching.

Help is nearly at hand. Aware that Freesat from Sky is rubbish, some broadcasters including the BBC and ITV have got together in a freeview-style consortium to develop their own free satellite platform called, you guessed it, Freesat.

Now Freesat looks really promising. Firstly, as their not ties to now-ancient BSkyB hardware, they can inist every box is up to date. The big news is that ALL boxes will be HD-compatible - so its going to be the quickest and cheapest way to get BBC, ITV and eventually C4 HD services. No Subscription. Very quickly, there will be boxes with Broadband connectivity, Hard Drives and more - and when you have the bandwidth of a satellite feed, storage of a hard drive, and back-channel of the Internet, you have the single best home entertainment platform on the market - its going to rock.

We've already been doing a lot of work at the BBC about what we can do on this platform, and we're really excited about it.

Now we have to sit tight until the Spring when it launches. I for one will be getting a Freesat box to replace my crappy Sky box - even though it'll have to sit next to my Virgin V+ box. For now that is.....

Google Earth Flight Simulator

So if Google Earth wasn't good enough, they then released Google Sky. But they got bored of that, and now, hidden away inside the latest version of Google Earth is an awesome flight simulator. No, seriously, they're just taking the p*** now.

Its well worth trying out. Download Google Earth, install it, and when its up and running hit Option + Apple + A on a mac, or Ctrl+Alt+A on a PC.

And let the magic begin....