Naidheachdan - News


Index
2020 January February March April May June July August September October November December
2019 January February March April May June July August September October November December
2018
January February March April May June July August September October November December
2014 January February March April May June July August September October November December
2013 January February March April May June July August September October November December
2012 January February March April May June July August September October November December
2011
January February March April May June July August September October November December
2010 January
February March April May June July August September October November December
2009 April May June July August September October November December


July 2020
Maybe a four year interval is the new normal for updating the news page! Lots has happened since the last update.

We’ve done a complete re-design of the way we store the linguistic data, making it much more compartmentalised. For example, we previously stored both GOC and non-GOC spellings in the same fields but we’re now starting to tease these apart, which means that eventually we should be able to modify the Hunspell spellchecker to do either GOC or continue to cover both GOC and traditional spellings. We’ve also started using standard Part-of-Speech tags developed by Edinburgh Uni - while that’s not visible to dictionary users as such, in future it might drive a grammar checker and suchlike. In the same vein, we’ve come up with some hyphenation rules and are also adding this data as we go along, so hopefully some time we’ll be able to expand the spellcheckers with hyphenation rules too. While we’re at it, we’ve also developed 27 subject categories and are looking to add something to the interface in future that will allow domain specific searches, for example if a users wants to specifically find legal terms only. In the wake of this, we’re also working on filtering out place names by default. Don’t worry, we’re not taking them out of the dictionary but you’ll have to tick a box to include them. There are almost 8,000 place names by now and there will be many more in the future and they’ve begun to clutter up the search results considerably (e.g. if you search for baile or dùn)

The maps now have got a few new pin colours too - one to mark data from Irish and Manx and another for linguistic data gleaned from place names (see for example geala, a word for a leech attested only in Dumfries place names). We include this data because in some cases, words given in dictionaries are from dialects that have been extinct for so long that only place name data can help clarify their region of origin. Similarly, including some limited data from Irish and Manx often nicely rounds out the map data, often providing the South-Western corner of the arc of peripheral dialects, like the word sioc.

What else? Apart from fixing a host of bugs, Will also made the layout more flexible so it works better when you resize the window, especially on mobile devices. And the pièce de résistance, he spent a lot of time on making the alphabetic search order much better. Previously we’d been relying on a system sorting algorithm which, well, it works for English. But from the Gaelic point of view it was really annoying because it would put spaces ahead of hyphens, which meant that if you did a search for cailleach, Cailleach Bheurra and cailleach bhuan would come in front of cailleach-baic because that’s the way English sorts stuff. But now if you choose to sort results alphabetically, it ignores hyphens and apostrophes, treats à/á like a and ignore caps:



May 2018
Crikey. You turn your head for a while and four years pass. It's been a complicated few years. Either way, time for a wee update:

We've added most of Dwelly's images to the respective entries. The reason some are missing is because we've got a display problem in the really short entries - the dictionary pushes the image into the abbreviated display when you search and since that's both ugly and not great (it would make mobile searches much more data-hungry for example), we've left out a couple of dozen. But none of them involve Dwelly's complex schematics like bàta or taigh, so the really important ones are all there. We've also started using some images in the new half of the dictionary, to begin with for cattle earmarks but we'll add more as time passes. Certainly for those which describe things which are hard to imagine if you've never seen them.

100,000 entries! Well, by now there's another 700 or so but back in April, our intrepid editor Susan Harris, a Gaelic learner from Portland, made the 100,000th entry in the dictionary. Eaglais Chàrnaich it happened to be. Before you start making comparisons with Dwelly (who has just under 78,000 entries), remember that a Dwelly entry is always a headword whereas in the Faclair Beag, an entry can also be an expression or sample sentence. It's a bit difficult to count headwords in the Faclair Beag but the number is somewhere between 50-60,000. Give or take. So not bad for a wee dictionary. In terms of votes (the pins on the maps), we're also approaching a milestone, not many missing before we hit 400,000. What I like most about them is that now when there's a debate about who uses which word, in quite a few cases you can now finally back up your claims with actual data beyond personal experience which - as we all know - is always limited. The one that came as a particular surprise to me was the fact that bùrn is not so much a Lewis word but Lewis just being the western end of an arc of bùrn-sayers that used to include Ross and down the eastern edge all the way to East Perthshire. Of course there aren't many speakers left in many of those dialects so perhaps today it IS just a Lewis word. But still.

The dictionary continues to feed other projects like the spellchecker - where we hope the next edition later this year should also sport hyphenation and perhaps even a small, experimental thesaurus based on the Faclair Beag data. Keep your ears peeled. Also in the mix are (I just realized I'd never mentioned this) word games for mobile phones such as Wordumz (a cross between Tetris and Scrabble) and Webfeud (think Scrabble with a slightly different board).

Last but not least you'll have noticed a blatant piece of advertising for Michael's latest venture - a Gaelic t-shirt shop. Well, technically he's been carrying the idea around for more than a decade and technically, there are more languages in the shop than just Gaelic. But it's mainly Gaelic. After the first design range, the word maps, one of Michael's old school friends who can do amazing things with a pen has joined the venture and we're working on a cartoon range and other design ranges are sure to follow. We hope it's not too distracting and we promise we won't go down the route of flashy adds for stuff you don't really need! Currently on sale online and at the Gaelic Bookshop.

March 2014
¿Qué pasa? Not a bad question. Quite a few things actually. The dictionary continues to grow and now has just over 60,000 entries. Not bad, even if we say so ourselves. As a result of this growth, there are of course knock-on effects. The most immediate is that the upcoming version of the Dearbhair Beag (version 2.8) will contain 877,365 forms. The single biggest collection of Gaelic words ever. And further down the line, this growth also fills out other word-based resources, such as the files for the Gaelic version of Scrabble3D - which incidentally now offers you games at three levels: basic (the 2000 most commonly used words), advanced (witht all words in the Faclair Beag) and crazy (with all of Dwelly’s words) - and a few other things, such as GCompris, a suite of educational games for children which also include word games.

A partial extract of our data is also helping two other great projects, CLILSTORE and Wordlink, improve their search functionalities with regard to other Gaelic dictionaries which don’t have an inbuilt lemmatizer (sorry, geek-speak, a tool that leads you from thaighean to the entry for taigh). So small as Gaelic may be, in terms of dictionary resources, we’re probably punching above our weight in some ways. None of the Irish dictionaries for example have a lemmatizer. So there :)

We’re also about to enter into a collaboration with the Gaelic Corpus project at Edinburgh University who are going to use out lexical database to help speed up the tagging of the corpus. So another nice side effect of the Faclair Beag.

In other news we are continuing with the cleanup of the dictionary and the to-do list is fortunately shrinking. Just this month I finally completed replacing all the placeholders which were inherited from the Faclair nan Gnàthasan-cainnte (things like [N] or [ADJ]) with full examples. Yes, there were indeed thousands of them. Hence the dram I’m having just now! Well, cup of darjeeling, but I might have a dram later on...

If you have any great ideas on what one could do with such a wordlist, do let us know, we don’t bite!

October 2013
So, what’s new I hear you asking? Not a massive amount. We’re slightly past 54,000 entries, which is nice and we’re looking forward to the 55,000 milestone. One of the things that have kept Michael busy this summer was the remake of the iGàidhlig website, a one-stop shop for Gaelic software. Anything from predictive texting to digital whiteboards, advice on how to type accents to installing LibreOffice or Microsoft Office in Gaelic. Apart from being a lot more graphic, the new site also is bilingual, which is nice if you want to point IT support at the site (since they often don’t speak Gaelic). Give it a go, it’s braw!

As we’re heading into the long, dark nights of the Scottish winter, why not put aside the sudoku and help us try and figure out some of these words which we can’t quite figure? Go on, you know you like a challenge!

Still having intermittent problems with the hosting, I’ll spare you the details but we are trying to resolve the issue. Bear with us!

April 2013
Under the 'hood

2012 was a pretty horrendous year in terms of workload, both for Will and me so apart from adding steadily to the dictionary (there are now just over 50,000 entries - entry 50,000 having been dìthean-caorach), there hasn't been a huge amount of stuff apart from the predictive texting and the maps. Ok, so maybe not so minor. Anyway, we're getting back into the swing and fixed some under-the-hood stuff that won't affect you as dictionary users on the whole but that will make life easier for the editor. One visible change is that the sound player now only appears when there actually is a sound file (that had been annoying many people, sorry about that).

We've got plans for a lot of stuff, like better fuzzy searches (the current system is just too Anglo-centric), an RSS feed, maybe a tooltip dictionary tool... we'll see, time and work permitting!

Minor Update: We're now also showing negative votes, that is, words where a votes has stated that they do not know a word in question. There aren't anywhere near as many of them as there are positive votes but it can still be useful to see that a word is not used by people or in a certain are. In the word druid for example.

November 2012

Guess what? Yup, we've used and abused the Faclair Beag again, this time to create a predictive texting tool for Gaelic. In a way it's a totally logical step and in a way slightly left-field so let me expand a little on the convoluted history.

There are two main strands to this. Anyone who has ever tried to text in Gaelic knows what a chore it is to do it letter by letter. It's slow at best, even slower if you're dìcheallach and try to put in the accents. And rather frustrating because, depending on your phone, the system will keep trying to "correct" your Gaelic to English so if you're now careful, tha very easily becomes the. So given the enduring popularity of texting, a language in a technologised country today cannot really afford not to have predictive texting. You'd think so, wouldn't you? Seems like arts projects are SO much more popular when it comes to funding... but anyway. So I've always kept an eye open for an opportunity.

One thing I did not want to do is fall into the trap that the Irish did some years ago with Téacs. It was a good idea in the sense that predictive texting was needed for Irish. But the mistake was to go with a custom built solution. Doing something like this means two things: 1) You tie yourself to a development process to ensure that new phones are properly supported, bugs are fixed and new features (if needed) developed and 2) You tie the end-user to your specific tool.

Is that so bad? Yes and no. The problem with Téacs was that it only offered Irish. A bit duh because we all know that one thing bilingual people do is switch between languages a lot. So what about English? And the second problem was that they didn't maintain the program properly which meant that it very soon became very out-of-date. So their list of supported phones in 2012 looks rather embarrassing.

So I knew that if this was going to work long-term, it would most likely mean joining a bigger project which was open to new languages. And in July last year my periodid trawling of the web for such a project finally threw up a result - a project called Adaptxt which had just gone Open Source.

This is where the other strand of the story comes in. At the most basic level, predictive texting just relies on a wordlist but many such tools, including Adaptxt, are much smarter than that and try to rank words according to how common they are (so you're not offered interregnum when you're looking to type internet). And in the case of Adaptxt, they also try to get smart about predicting the next word you're likely to type. In the case of Gaelic this means that if you type Bha, you're offered mi as a likely next word. Sure, Adaptxt is capable of learning but who wants to train their phone from scratch every time they buy a new one?

Back in 2009 I was in Dublin on a research trip courtesy of Bòrd na Gàidhlig to look into speech and language technology for Gaelic. It just so happened that a guy called Kevin Scannell (who is a professor of computing at the University of St Louis and into Irish and Irish software big time) was in town and we met up in the Club Chonradh na Gaeilge (highly recommended!). The evening's a bit of a blur but I walked away with loads of notes on things like "lexical databases" and "web crawlers" and whatnot. Shortly thereafter, Kevin successfully reeled me into the Firefox translation project and stuff just snowballed from there in terms of Gaelic software translation. Another spinoff was the Faclair Beag itself. Not so much the idea of it but how we did it. So instead of just doing a really primitive "list" based dictionary (e.g. with on one side and dog on the other), we got kinky and took onboard much of what Kevin had extolled. As a result, the Faclair Beag not only knows that cù = dog but also that is the nominative singular of a masculine noun, that chù is the lenited form, that coin is the genitive singular of a masculine noun... and so on. This means it's not only real smart when it comes to finding the right word if you put on something else that the citation form (i.e. if you put in conaibh instead of for example) but it also allows us to build lots of sexy stuff on the back of it.

Because marrying this with Adaptxt was beyond my ken of the sgoil-dhubh of programming, I got in touch with Kevin and asked if he'd be up for doing a joint project to create state of the art predictive texting for Irish and Scottish Gaelic - which he was! And in the spirit of Goidelic brotherhood, we also decided to do Manx Gaelic at the same time. So we took the Gaelic data from the Faclair Beag, Kevin then ranked each word using a massive text corpus he has (so for example, conaibh is out, but tha is right there at the top) and sent me that data. I then invested a certain amount of sweat, coffee and patience and to cut this long story short, here we are. So, interested? You can just search for "Adaptxt" in Google Play (the installation isn't hard) but if you want a step by step illustrated guide, here's one here I did earlier. An dòchas gun còrd e ribh!


October 2012
DRINK ME, EAT ME!
The Faclair has gone Lewis Carroll. All entries have been replaced with white rabbits. Nah, don't panic. Will has improved the use of the screen size by making the column width adjust to the size of your screen and/or window. So if you're on a tiny 11" monitor you probably won't see that much difference but say you're on a massive 70" monster, you'll find that most entries will display over three lines max, saving you a lot of vertical scrolling:

Alternatively, it also works if you resize your browser window, for example if you want to have it side by side with a text document you're working on. Like this for example:

Enjoy - and keep the ideas coming!



April 2012
Maps, glorious maps

For all those who have been wondering about why we're collecting all these votes, there's finally an answer! They feed our map tool. Like traditional dialect maps, these give you an indication of where words are used. Like these two (one of my all-time favourites):

 
Not all of them are quite that detailed yet but we're working on it!

So how do you view the maps? Easy, just search for a word and click on the blue underlined word, for example in the above case, feum and mand. The map will come up and display any votes and also a link to the Help page for the maps (which has more detailed info). Oh, and while most votes are in Scotland, there are some cropping up abroad, especially in Nova Scotia!

Mobile Version
There's now also a mobile version of the Faclair at www.faclair.com/m - same as the desktop version really but we've collapsed the Advanced Search feature and used a smaller logo to save space on screen. It's specifically for mobile phones but if you're on a slow connection, there's no reason why you couldn't use it on a desktop too but note that if you're a user with voting rights, you can't get that feature in the mobile version.

You can also put a link on your mobile phone's desktop now to get to it real quick. You need to do the following:
1) Android Phones
    a) Bookmark the page in the phone's default browser
    b) Go to your Bookmarks and press and hold and when the options come up, tell it to Add shortcut to Home. That's it.
   
2) iPhones
    a) Go to the page and tap the Bookmark icon on screen.
    b) When the menu comes up, press Add to Home Screen. That's it.
 



2011 September
A Gaelic Scrabble
Before you ask what Scrabble has to do with the Faclair - it's yet another one of those interesting uses you can put a database of words to. With a bit of tidying up (to remove names and other proper nouns), it's not that hard to build a dictionary file for something like Scrabble. Want to have a go?



2011 June
Am Faclair Beag on LearnGaelic
Well, who would have thought that? After lots of meetings and more draft documents flitting backwards and forwards through cyberspace, MG Alba have bought, that's right, bought, a license to use our dictionary data on their new LearnGaelic website.

Though Tahiti is still not an option, this is certainly a welcome development indeed!



2011 January
Check your spelling, sir?
One of the first spin-offs we've been working on is a selection of Gaelic spellcheckers. There's a couple out there already but, well, let's just say they're not being maintained well.

So, by using the database in the Faclair, we've been able to join up with an Open Source project called Hunspell and script druid called Kevin Scannell to create spellchecking tools which will work in Mozilla Firefox and Thunderbird, Opera and LibreOffice/OpenOffice. If you're using the Gaelic version of Firefox/Thunderbird and LibreOffice, the spellcheckers already come bundled with the software but if you're using the English version, you can get the Mozilla spellchecker here and the LibreOffice one here (also works in OpenOffice).

By co-operating with other projects in this way, we can ensure that both the software and the spellchecking dictionary will be maintained properly and regulary, which means:

Oh and they're all free of charge!



2010 August
The Faclair at Rannsachadh na Gàidhlig in Aberdeen
We kind of left it a bit late registering but ended up doing a paper nonetheless. Well.. I say paper. It was mostly a presentation really on the timeline of our two dictionaries, starting with the digitisation of Dwelly's, the birth of the Faclair Beag and the planned spin-off projects, such as spellcheckers and predictive texting and so on. Perhaps not high-brow academic as such but I feel it was a worthwhile paper nonetheless because it shows what you can do with a properly built lexical database - even a relatively simple one.

It was well received and one of the member of the audience made me laugh, he came up to me and said "Don't take this the wrong way - but only a German could have done this". He then explained that it was the clear sense of direction of the dictionary project, its execution and logical progression which had prompted him to make this amusing compliment. Ach well, the Gàidheileamailtich score again :)

If you want to see the presentation (it's in Gaelic), you can get the PDF here.



2009 April
A dictionary is born!
We always said that Dwelly-d would be just the start and so, mar a chanas iad, 's e gnìomh a dhearbhas - here you are. It's called the Faclair Beag because, well, it's kinda small still even though we have big plans for it. So bear with us for now if you find gaps - but other than that, we hope you find it useful in using or learning Gaelic!