The organisation has also built a "general-purpose, multi-language political data-mining system" to search "massive datasets" for specific terms.
The "Syria Files" relate to emails dated August 2006 to March 2012, according to an announcement by WikiLeaks.
"This extraordinary data set derives from 680 Syria-related entities or domain names, including those of the ministries of presidential affairs, foreign affairs, finance, information, transport and culture."
WikiLeaks added it is "statistically confident that the vast majority of the data are what they purport to be."
The whistleblowers' website said that stories relating to the data will be published by WikiLeaks as well as a number of other publishers, including ARD in Germany, Associated Press in the US, L’Espresso in Italy and Owni in France.
In a press conference WikiLeaks outlined a new "data-mining system" which can be used to search for terms in documents, attachments and all file names within the attachments, and can also exclude certain terms.
Journalism.co.uk understands the new "search interface" will not be publicly available, but for use by WikiLeaks and journalists at partner news outlets.
Free daily newsletter
- 4 approaches to building collaborative data infrastructures for journalism
- New, year-long project from the Guardian documents knife crime in the UK
- One year since launch, the Refugee Journalism Project is hoping to expand across the UK
- Tip: Advice for finding the story in your data set
- A new dashboard from the FT helps editors identify and promote relevant archive stories