The organisation has also built a "general-purpose, multi-language political data-mining system" to search "massive datasets" for specific terms.
The "Syria Files" relate to emails dated August 2006 to March 2012, according to an announcement by WikiLeaks.
"This extraordinary data set derives from 680 Syria-related entities or domain names, including those of the ministries of presidential affairs, foreign affairs, finance, information, transport and culture."
WikiLeaks added it is "statistically confident that the vast majority of the data are what they purport to be."
The whistleblowers' website said that stories relating to the data will be published by WikiLeaks as well as a number of other publishers, including ARD in Germany, Associated Press in the US, L’Espresso in Italy and Owni in France.
In a press conference WikiLeaks outlined a new "data-mining system" which can be used to search for terms in documents, attachments and all file names within the attachments, and can also exclude certain terms.
Journalism.co.uk understands the new "search interface" will not be publicly available, but for use by WikiLeaks and journalists at partner news outlets.
Free daily newsletter
- A new dashboard from the FT helps editors identify and promote relevant archive stories
- New analytics tool Kaleida shows what stories and topics matter to readers
- Tip: Bookmark this collection of data storytelling tools
- ProPublica is collaborating with newsrooms to create a national database for hate crimes and bias incidents in the US
- Report: Technology trends journalists should watch in 2017