Browse our Products

Aspose.OCR for Java 24.3.0 - Release Notes

Deprecation warning

What was changed

KeySummaryCategory
OCRJAVA‑356The library can now extract texts in 133 languages and automatically detect Extended Latin and Cyrillic texts without specifying the language.Enhancement
OCRJAVA‑356Some legacy language codes were deprecated. See details in Deprecated APIs section.Enhancement

Public API changes and backwards compatibility

This section lists all public API changes introduced in Aspose.OCR for Java 24.3.0 that may affect the code of existing applications.

Added public APIs:

No changes.

Updated public APIs:

Updated public APIs:

The following public APIs have been introduced in this release:

Language enumeration

Aspose.OCR for Java greatly expands the number of languages supported. We have added a total of 135 languages based on Latin, Cyrillic, Chinese and Indic scripts:

ValueAlphabet
Language.ExtLatinAll supported Latin characters and diacritics (auto-detect)
Language.CyrillicAll supported Cyrillic characters (auto-detect)
Language.AfrAfrikaans
Language.AlnAlbanian
Language.AwaAwadhi
Language.AzbAzerbaijani (Azeri)
Language.BclBikol
Language.BelBelarusan (Belorussian)
Language.BemBemba (Chibemba)
Language.BewBetawi
Language.BgcHaryanvi
Language.BhoBhojpuri
Language.BhrMalagasy
Language.BjjKanauji
Language.BosBosnian
Language.BulBulgarian
Language.CatCatalan
Language.CcxZhuang
Language.CdoMin Dong
Language.CebCebuano
Language.CesCzech
Language.CheChechen
Language.CmnMandarin (Chinese)
Language.CpxPu-Xian
Language.DanDanish
Language.DeuGerman
Language.DhdDhundari
Language.DiqDimli
Language.DocDong
Language.EngEnglish
Language.EstEstonian
Language.FinFinnish
Language.FraFrench
Language.GanGan
Language.GaxOromo
Language.GbmGarhwali
Language.GlgGalician
Language.GlkGilaki
Language.GuzGusii
Language.HakHakka
Language.HauHausa
Language.HbsSerbo-Croatian (Latin)
Language.HilHiligaynon
Language.HinHindi
Language.HmnHmong
Language.HneChattisgarhi (Laria, Khaltahi)
Language.HrvCroatian
Language.HsnXiang
Language.HunHungarian (Magyar)
Language.IloIlocano
Language.IndIndonesian
Language.ItaItalian
Language.KazKazakh
Language.KbdKabardian
Language.KfyKumauni
Language.KinRwanda
Language.KlnNandi
Language.KmrKurdish (Kurmanji)
Language.KncKanuri
Language.KnnKonkani
Language.KonKikongo
Language.LatinLatin
Language.LavLatvian
Language.LitLithuanian
Language.LmnLamani (Lambadi)
Language.LncOccitan
Language.LuoLuo
Language.MagMagahi
Language.MaiMaithili
Language.MakMakassar (Makasar)
Language.MarMarathi
Language.MerMeru
Language.MinMinangkabau
Language.MlyMalay (Melayu)
Language.MnpMin Bei
Language.MtqMuong
Language.MtrMewari
Language.MuiMusi
Language.MupMalvi
Language.NanMin Nan
Language.NblNdebele
Language.NdsLow German
Language.NepNepali
Language.NldDutch
Language.NorNorwegian
Language.NsoSotho (Northern)
Language.NyaChichewa (Chewa, Nyanja)
Language.PagPangasinan
Language.PamKapampangan
Language.PccBouyei (Buyi, Giáy)
Language.PlmPalembang
Language.PolPolish
Language.PorPortuguese
Language.QucK’iche'
Language.QxaQuechua
Language.RjbRajbanshi
Language.RonRomanian
Language.RufLuguru
Language.RusRussian
Language.RwrMarwari
Language.SasSasak
Language.SlkSlovak
Language.SlvSlovene (Slovenian)
Language.SnaShona (Karanga)
Language.SomSomali
Language.SotSotho (Southern)
Language.SpaSpanish
Language.SrpSerbian (Cyrillic)
Language.SrrSerer-Sine
Language.SswSwati (Swazi)
Language.SukSukuma
Language.SunSundanese (Sunda)
Language.SweSwedish
Language.SwhSwahili
Language.TglTagalog (Pilipino)
Language.ToiTonga
Language.TsnTswana
Language.TsoTsonga
Language.TukTurkmen
Language.TumTumbuka
Language.TurTurkish
Language.UkrUkrainian
Language.UmbUmbundu
Language.VieVietnamese
Language.VmwMakua (Makhuwa)
Language.WalWolaytta
Language.WarWaray-Waray
Language.WbrWagdi
Language.WtmMewati
Language.WuuWu (Changzhou)
Language.XhoXhosa
Language.YaoYao
Language.YorYoruba
Language.YueCantonese
Language.ZulZulu

Removed public APIs:

No changes.

Deprecated APIs

The following public APIs have been marked as deprecated and will be removed in 25.1.0 (January 2025) release:

Language.None

Use Language.ExtLatin to automatically detect any Latin characters and diacritics.

Language.Chi

Use one of the following languages depending on the actual language of the text:

  • Language.Cmn (Mandarin)
  • Language.Wuu (Wu)
  • Language.Yue (Cantonese)
  • Language.Nan (Min Nan)
  • Language.Hsn (Xiang)
  • Language.Hak (Hakka)
  • Language.Gan (Gan)

Language.Cze

Use Language.Ces for recognizing Czech texts.

Language.Dum

Use Language.Nld for recognizing Dutch texts.

Language.Srp_hrv

Use Language.Hbs for recognizing Serbo-Croatian texts (based on Latin alphabet).

Language.Rum

Use Language.Ron for recognizing Romanian texts.

Examples

The code samples below illustrate the changes introduced in this release:

Auto-detect Extended Latin characters

// Initialize Aspose.OCR recognition API
AsposeOCR api = new AsposeOCR();
RecognitionSettings recognitionSettings = new RecognitionSettings();
// Add image to the recognition batch
OcrInput source = new OcrInput(InputType.SingleImage);
source.add("image.png");
// Specify recognition language
RecognitionSettings recognitionSettings = new RecognitionSettings();
recognitionSettings.setLanguage(Language.ExtLatin);
// Extract text from image
ArrayList<RecognitionResult> results = api.Recognize(source, recognitionSettings);
System.out.println(result[0].recognition_text);