The organisation has also built a "general-purpose, multi-language political data-mining system" to search "massive datasets" for specific terms.
The "Syria Files" relate to emails dated August 2006 to March 2012, according to an announcement by WikiLeaks.
"This extraordinary data set derives from 680 Syria-related entities or domain names, including those of the ministries of presidential affairs, foreign affairs, finance, information, transport and culture."
WikiLeaks added it is "statistically confident that the vast majority of the data are what they purport to be."
The whistleblowers' website said that stories relating to the data will be published by WikiLeaks as well as a number of other publishers, including ARD in Germany, Associated Press in the US, L’Espresso in Italy and Owni in France.
In a press conference WikiLeaks outlined a new "data-mining system" which can be used to search for terms in documents, attachments and all file names within the attachments, and can also exclude certain terms.
Journalism.co.uk understands the new "search interface" will not be publicly available, but for use by WikiLeaks and journalists at partner news outlets.
Free daily newsletter
- ProPublica is collaborating with newsrooms to create a national database for hate crimes and bias incidents in the US
- Report: Technology trends journalists should watch in 2017
- How Reuters trains its journalists to work with new technologies and collaborate in the newsroom
- 'The story doesn't end with a spreadsheet' – Advice for journalists working with data
- Data, video, sponsored content and more: Survey highlights what publishers will be prioritising in 2017