Date: 18/01/2020 22:36:31
From: mollwollfumble
ID: 1487981
Subject: Newspaper archive search?

I love the Trove search of Australian newspapers. https://trove.nla.gov.au/newspaper/search?adv=y

Is there anything similar for other counties? (Any countries other than UK, don’t care about UK)
Where I can search multiple newspapers in multiple states simultaneously, for as far back as the earliest newspapers in that country.

Reply Quote

Date: 19/01/2020 00:09:18
From: SCIENCE
ID: 1487999
Subject: re: Newspaper archive search?

google has something like this

Reply Quote

Date: 19/01/2020 07:02:28
From: mollwollfumble
ID: 1488006
Subject: re: Newspaper archive search?

SCIENCE said:


google has something like this

This one? Ta, hadn’t seen it before.
https://news.google.com/newspapers

Try Nevada Daily Mail
I can browse by year. https://news.google.com/newspapers?nid=M3zsPnPgUlUC

I can search by topic. eg. https://www.google.com/advanced_search?q=indian+%22Nevada+Daily+Mail%22+site:news.google.com/newspapers

Search by topic doesn’t give results from Nevada Daily Mail, and I can’t limit it to any time.

OK, it looks like Google Newspapers gives images of newspapers, one newspaper at a time, but not searchable text. Good start, though.

Reply Quote

Date: 19/01/2020 07:07:06
From: Divine Angel
ID: 1488007
Subject: re: Newspaper archive search?

Trove is the best in the world. I know of no other countries with anything remotely similar.

Reply Quote

Date: 19/01/2020 07:19:23
From: mollwollfumble
ID: 1488010
Subject: re: Newspaper archive search?

Divine Angel said:


Trove is the best in the world. I know of no other countries with anything remotely similar.

If only, if only, they had a good image to text converter.

Reply Quote

Date: 19/01/2020 07:53:40
From: Tamb
ID: 1488011
Subject: re: Newspaper archive search?

mollwollfumble said:


Divine Angel said:

Trove is the best in the world. I know of no other countries with anything remotely similar.

If only, if only, they had a good image to text converter.


Morning all.
Trove may not have an image to text converter but Microsoft does. https://www.techwalla.com/articles/how-to-convert-an-image-to-text

Reply Quote

Date: 19/01/2020 08:47:19
From: The Rev Dodgson
ID: 1488013
Subject: re: Newspaper archive search?

mollwollfumble said:


(Any countries other than UK, don’t care about UK)

LOL

I’m the same with San Marino.

Reply Quote

Date: 19/01/2020 11:29:41
From: SCIENCE
ID: 1488092
Subject: re: Newspaper archive search?

Tamb said:


mollwollfumble said:

Divine Angel said:

Trove is the best in the world. I know of no other countries with anything remotely similar.

If only, if only, they had a good image to text converter.


Morning all.
Trove may not have an image to text converter but Microsoft does. https://www.techwalla.com/articles/how-to-convert-an-image-to-text

ain’t nothin’ new, this kind of thing’s been around for ages, some people think it sounds a bit rough and uncultivated but we reckon it’s nice and earthy

Reply Quote

Date: 19/01/2020 11:31:30
From: Tamb
ID: 1488093
Subject: re: Newspaper archive search?

SCIENCE said:


Tamb said:

mollwollfumble said:

If only, if only, they had a good image to text converter.


Morning all.
Trove may not have an image to text converter but Microsoft does. https://www.techwalla.com/articles/how-to-convert-an-image-to-text

ain’t nothin’ new, this kind of thing’s been around for ages, some people think it sounds a bit rough and uncultivated but we reckon it’s nice and earthy

It’s a bit kludgy but it works.

Reply Quote

Date: 19/01/2020 12:54:28
From: mollwollfumble
ID: 1488114
Subject: re: Newspaper archive search?

> https://www.techwalla.com/articles/how-to-convert-an-image-to-text

But does it work in Arabic?
Seriously though, I need to look into this because all the image to text converters I’ve seen so far are crap.
The ideal way is to pre-program the newspaper font into the converter – then ONLY use that font when reading the newspaper.

> https://news.google.com/newspapers

Since this only works for browse newspaper one by one, I’ve made a list of all (?) notable early newspapers. To make it onto the list, the newspaper has to be:

Reply Quote

Date: 19/01/2020 13:12:04
From: The Rev Dodgson
ID: 1488123
Subject: re: Newspaper archive search?

mollwollfumble said:


> https://www.techwalla.com/articles/how-to-convert-an-image-to-text

But does it work in Arabic?
Seriously though, I need to look into this because all the image to text converters I’ve seen so far are crap.
The ideal way is to pre-program the newspaper font into the converter – then ONLY use that font when reading the newspaper.

> https://news.google.com/newspapers

Since this only works for browse newspaper one by one, I’ve made a list of all (?) notable early newspapers. To make it onto the list, the newspaper has to be:

  • First published before 1900 (except for Nigeria and Kenya)
  • Published for at least 50 years
  • If in a foreign language, only one per country (except Japan)


What are all those damned UK papers doing on that list?

Reply Quote

Date: 19/01/2020 14:51:57
From: mollwollfumble
ID: 1488188
Subject: re: Newspaper archive search?

The Rev Dodgson said:


mollwollfumble said:

> https://www.techwalla.com/articles/how-to-convert-an-image-to-text

But does it work in Arabic?
Seriously though, I need to look into this because all the image to text converters I’ve seen so far are crap.
The ideal way is to pre-program the newspaper font into the converter – then ONLY use that font when reading the newspaper.

> https://news.google.com/newspapers

Since this only works for browse newspaper one by one, I’ve made a list of all (?) notable early newspapers. To make it onto the list, the newspaper has to be:

  • First published before 1900 (except for Nigeria and Kenya)
  • Published for at least 50 years
  • If in a foreign language, only one per country (except Japan)


What are all those damned UK papers doing on that list?

Well at least I left “The Times” off.

Reply Quote

Date: 19/01/2020 15:40:49
From: dv
ID: 1488209
Subject: re: Newspaper archive search?

Jesus H … there have been free image-to-text converters online since the last millennium

Reply Quote

Date: 19/01/2020 20:33:34
From: mollwollfumble
ID: 1488298
Subject: re: Newspaper archive search?

dv said:


Jesus H … there have been free image-to-text converters online since the last millennium

Yes, but they’re all “artificial intelligence” style image to text conversion. Which means, no control of fonts, and hence no friggin way to improve the output.

eg. The first newspaper article from a random search of Trove contains a passage which converts to text as:

The original is:

And even Blind Freddy can read that as: Fresh beef and mutton, 1s. 6d,; bacon 2s.; butter (salt) 1s. 4d.; fresh 2s. 6d.; kangaroo 1s.; pork (fresh) 1s. 6d.; salt ditto 6d.; cheese 2s.; coffee 1s. 6d.; etc.,

If, if ever you find even one image-to-text converter program that allows me to input my own font(s) then let me know immediately.

Reply Quote

Date: 19/01/2020 21:03:39
From: ruby
ID: 1488300
Subject: re: Newspaper archive search?

mollwollfumble said:


dv said:

Jesus H … there have been free image-to-text converters online since the last millennium

Yes, but they’re all “artificial intelligence” style image to text conversion. Which means, no control of fonts, and hence no friggin way to improve the output.

eg. The first newspaper article from a random search of Trove contains a passage which converts to text as:

  • Fresh berf and mutton, Is. Gd.; bacon 2s . ; Lult-r(#i!i) Is. 4 d .; fresh 2s. (id .; ian sron I s .; pn;k (fresh) Is. (Id.- salt ditto G d.; Vle-ese 2 s.; cufiec Is-Gd.; raisins Is.; dried fruits is, Gd.; rice fid .; potatoes 4 d ,; teu 7 s .; sugar is. 4 d .; flour 6 d ; soap Is. 6 d .; starch 2* ; gunpowder 4spvr

The original is:

And even Blind Freddy can read that as: Fresh beef and mutton, 1s. 6d,; bacon 2s.; butter (salt) 1s. 4d.; fresh 2s. 6d.; kangaroo 1s.; pork (fresh) 1s. 6d.; salt ditto 6d.; cheese 2s.; coffee 1s. 6d.; etc.,

If, if ever you find even one image-to-text converter program that allows me to input my own font(s) then let me know immediately.

Sounds like you could get involved with correcting stuff in Trove-
https://help.nla.gov.au/trove/using-trove/getting-to-know-us/trove-is

Image to text can do good things, but there’s the need for humans to get in too. I got involved with a project that had old ship logs with weather observations, they got volunteers to read the old entries which were in often beautiful writing, and to type out all the info so they could crunch the numbers. Beautifully preserved ships books are great, but you can’t feed them into the computer without a bit of help.

I was on Trove a few days ago, searching for new stuff on my grandparents. I’d found an article my grandfather wrote on his experiences at Poziers in WW1 a few months ago, to the delight of my family who had now known about it. I found another one from him written while on leave in Paris, which needs much more editing (I may go back and do it). Amazing to read him, over 100 years later. Found other stuff from both sides of the family, insights into times gone by, like long description of a wedding. There is fantastic stuff to be found in the old papers, worth just diving in for a lucky dip at times.

Reply Quote

Date: 19/01/2020 23:49:04
From: mollwollfumble
ID: 1488388
Subject: re: Newspaper archive search?

mollwollfumble said:


> https://www.techwalla.com/articles/how-to-convert-an-image-to-text

But does it work in Arabic?
Seriously though, I need to look into this because all the image to text converters I’ve seen so far are crap.
The ideal way is to pre-program the newspaper font into the converter – then ONLY use that font when reading the newspaper.

> https://news.google.com/newspapers

Since this only works for browse newspaper one by one, I’ve made a list of all (?) notable early newspapers. To make it onto the list, the newspaper has to be:

  • First published before 1900 (except for Nigeria and Kenya)
  • Published for at least 50 years
  • If in a foreign language, only one per country (except Japan)


It looks like none of the newspapers on that list, not a single one, is accessible through the https://news.google.com/newspapers website. I’ve drawn a complete blank on all the UK, other Europe, USA, and other Americas lists.

So scrap that.

Gutenberg website doesn’t do newspapers.

What about Wikipedia’s list of online newspapers?
Well at least three are there, behind a paywall :-(

Reply Quote