Really deep linking: Url + regex
secretGeek .:dot Nuts about dot Net:.
home .: about .: sign up .: sitemap .: secretGeek RSS

Really deep linking: Url + regex

Here's today's boil the ocean scheme idea — and it's not a new idea by any means. I'm sure it has occurred to many people at many times, I've just never seen it written down. (links to further reading welcome)

Just say you want to bookmark a particular paragraph on a particular web document.

Perhaps you want to perform the electronic equivalent of using a highlighter pen to point out a particular fragment of a document in its original context.

You can give the uri of the page, but you can't give a specific link to the actual paragraph you are interested in.

(Sure, if the author of the document provided a named anchor tag for the paragraph then you are in luck. But only semantic web fanboys and egomaniacs go to this kind of extreme)

What I'm thinking is that, instead, you could provide a little extra bit of regex-goodness in the url. Say for example you said:

href='http://secretGeek.net/regexUri.asp --highlight "Say for example you said:"'

...then this could act as a pair of instructions to the browser: the first one is "get from 'http://secretGeek.net/regexUri.asp' and the second instruction is "highlight the text that matches this regex: "Say for example you said:" (excluding the quotes). The instructions are separated by spaces, qualified by quotes and so on. Click on the link and the page is shown, with the relevant text highlighted.

Okay — i'm thinking four different things at once here:

  1. Why stop at highlight? what other commands could the new commandline accept?
  2. "Major security flaws waiting to happen"
  3. Some people, when confronted with a problem, think "I know, I'll use regular expressions." (etc...)
  4. Subtle differences in implementations between browsers.... NO!!!1!

And of course, as always, I'm hearing the voice of some strange ever-present character in my head (who sounds a lot like comic-book-store-guy from the simpsons) and he alternates between:

  • You can already do that in firefox with addins/greasemonkey, idiot. And
  • 'course, there'a an emacs command to do that

Thoughts? Additions? Subtractions? Jakob?





'Joseph Cooney' on Wed, 08 Jul 2009 09:17:49 GMT, sez:

You could, of course implement it unilaterally on your site in javascript. When it saw that kind of URL it could run a regex over the content of the page and do the highlighting of any matching elements.

Of course creating the URL in the first place would rely on end users learning regular expressions. While Jeff Friedl would probably love this to come to pass it doesn't seem likely to happen in the short to medium term.



'Dave' on Wed, 08 Jul 2009 13:18:28 GMT, sez:

A few random comments:

Since the space character has to be Url-encoded in links, you're relying on all browsers correctly splitting the Url and the "highlight" before making any requests - which is not backwards compatible with older browsers. And would result in a bunch of "href='http://secretGeek.net/regexUri.asp+--highlight+%22Say+for+example+you+said%3A%22%27"" requests and related server 404 errors.

I would propose re-using the existing fragment separator ('#') and making an additional constraint that the "highlight" is Url-encoded. That way, older browsers will just ignore it, while newer browsers can add the highlighting logic.

Of course, then we would have to add some way for those newer browsers to know what's a regular fragment and what is the new "highlight" fragment. Maybe it's treated as a regular fragment if it exists in the document, and as a highlight if the string doesn't match any element id's?



'shulmang' on Wed, 08 Jul 2009 14:17:04 GMT, sez:

while it's always nice to puzzle over a regex now and then, perhaps XPath would be good here? However, with XPath you would be specifying the markup rather than the content which may be less enduring.



'mat roberts' on Wed, 08 Jul 2009 14:53:37 GMT, sez:

I saw a related idea for permanent url linking on the horror a while ago http://www.codinghorror.com/blog/archives/000068.html
Not quite the same kind of thing your proposing...but



'Tom' on Wed, 08 Jul 2009 19:04:05 GMT, sez:

Adobe has actually had this built into their browser plugin for a little bit: http://partners.adobe.com/public/developer/en/acrobat/PDFOpenParameters.pdf

Of course, it's not always reliable and gets tripped up sometimes.



'Thom Lawrence' on Thu, 09 Jul 2009 04:33:54 GMT, sez:

There was/is an attempt to create a standard for this based on XPath:

http://www.w3.org/TR/WD-xptr



'David Andersson' on Fri, 07 Aug 2009 04:27:39 GMT, sez:

I've thought of this idea for a long time and think that properly executed it can be a hit. Myself though, I have still to figure out a good solution for it. But I often find I want to e-mail/message snippets to friends and colleagues with a highlight on the page.



'Textmarker' on Wed, 02 Sep 2009 13:56:57 GMT, sez:

Done! http://beta.marker.to/



'lb' on Wed, 02 Sep 2009 19:11:25 GMT, sez:

@TextMarker
interesting stuff.
It worked ok for me, though it didn't load any style sheets (which use relative urls) and relative links were also broke.
Plus, not seeing the original page's url anywhere was a bit disconcerting.
see http://beta.marker.to/25463
(I'd like it if there was an 'about' link at beta.marker.to ;-) I want to see who the geniuses were behind it)




name


website (optional)


enter the word:
 

comment (HTML not allowed)


All viewpoints welcome. But the right to delete any post for any reason is reserved. Don't make me do it. Comments may be republished, emailed to your loved ones or printed and used as toilet paper. Who reads this legal bit anyhow?

TimeSnapper is a life analysis system that stores and plays-back your computer use. It makes timesheet recording a breeze, helps you recover lost work and shows you how to sharpen your act.

TimeSnapper won last year's Developer Competition at Larkware.com, and is used by over 10,000 people.

Articles

What then, is b? What then, is b?
SQLike: A simple editor SQLike: A simple editor
Yet Another BizPlan Generator. Yet Another BizPlan Generator.
HOT GUIDS: A hot or not site for guids HOT GUIDS: A hot or not site for guids
How does life get better? One tiny hack at a time. How does life get better? One tiny hack at a time.
24 things to do, and 100 things *not* to do (yet) for building a MicroISV 24 things to do, and 100 things *not* to do (yet) for building a MicroISV
Venture capital won't kill Jeff Atwood, it will only make him Jeffer. Venture capital won't kill Jeff Atwood, it will only make him Jeffer.
A handy workflow image for newbie mercurial users A handy workflow image for newbie mercurial users
Fractal Feedback, a diversion into recreational programming Fractal Feedback, a diversion into recreational programming
Hump-Jumping: How the Education of Computer Science can be Saved, err, maybe. Hump-Jumping: How the Education of Computer Science can be Saved, err, maybe.
Suggested User Experience Improvements for DiffMerge Suggested User Experience Improvements for DiffMerge
SQL Style Extensions for C# SQL Style Extensions for C#
The Movie Hollywood (And My Wife) Doesn't Want You To See: Weekend at Jacko's The Movie Hollywood (And My Wife) Doesn't Want You To See: Weekend at Jacko's
Sysi: the ultimate administrators toolkit Sysi: the ultimate administrators toolkit

Archives .: secretGeek :: Complete Archives
TimeSnapper -- Automated Screenshot Journal TimeSnapper.com    
Version 3.3: true productivity boost

Next Action NextAction
Managing the top of your mind

World's Simplest Code Generator (html edition) World's Simplest Code Generator

25 steps for building a Micro-ISV 25 steps for building a Micro-ISV
3 minute guides -- babysteps in new technologies: powershell, JSON, watir, F# 3 Minute Guide Series
Universal Troubleshooting checklist Universal Troubleshooting Checklist
Top 10 SecretGeek articles Top 10 SecretGeek articles
ShinyPower (help with Powershell) ShinyPower
Now at CodePlex

Realtime CSS Editor, in a browser RealTime Online CSS Editor
Gradient Maker -- a tool for making background images that blend from one colour to another. Forget photoshop, this is the bomb. Gradient Maker


[powered by Google] 


How to be depressed How to be depressed
You are not inadequate.



Recommended Reading

The Best Software Writing I
The Business Of Software (Eric Sink)

Recommended blogs

Jeff Atwood
Joseph Cooney
Phil Haack
Scott Hanselman
Julia Lerman
Rhys Parry
Joel Pobar
OJ Reeves
Eric Sink

Aggregated Links

proggit
dzone
hacker news
dot net kicks

Human Link Machines

interesting finds
a continuous learner's weblog
arjan's world
weekly link post

LinkedIn profile
LogEnvy - event logs made sexy
ShuffleText - fuzzy search for .net
PC Smart Buys - Computer Hardware in Australia
 
home .: about .: sign up .: sitemap .: secretGeek RSS .: © Leon Bambrick 2006 .: privacy

home .: about .: sign up .: sitemap .: RSS .: © Leon Bambrick 2006 .: privacy